
Indian Agricultural 
JRmrarch Institute, Nff* Delhi 

liUA 

OIR I I A.R4 -10-5 15— IM00 




PROCEEDINGS 

OF THE 

ROYAL SOCIETY OF LONDON 


SERIES A. MATHEMATICAL AND PHYSICAL SCIENCES 


VOL 193 


LONDON 

Printed and published for the Royal Society 
By the Cambridge University Press 
Bentley House, N.W.i 


21 July 1948 



Printed in Great Britain at the University Press, Cambridge 
(Brooke Crutehley, University Printer) . 
and published by the Cambridge University Press 
( Cambridge , and Bentley House , London , N.W, 1) 

Agents for Canada and India: Macmillan 



CONTENTS 


SERIES A VOLUME 103 

No. A 1032. 22 April 1948 

PAGE 

The lattice spacings of solid solutions of different elements in aluminium. By 

H. J. Axon and W. Hu me-Roth cry, F.R.S.1 

Matrices of integers associated with self-transformations of surfaces. By T. G. 

Room, F.R.S.25 

Solar radiation in the radio spectrum. 1. Radiation from the quiet sun. By I). F. 

Martyn ............. 44 

A possible mode of propagation of the "slow’ or tail component in atmospherics. 

By A, L. Hales.60 

Magnetization curves for ferromagnetic single crystals. By II. Lawton and K. H. 

Stewart.72 

On the inhomogeneity of plastic deformation in the crystals of an aggregate. By 

W. Boas and M. E. Hargreaves. (Plates 1 and 2).89 

An investigation of radio-frequency radiation from the sun. By M. Ryle and 

D. D. Vonberg.98 

Natural evaporation from open water, bare soil and grass. By H. L. Penman. 

(Plate 3). 120 

No. A 1033. 27 May 1948 

Relaxation motho<Is applied to engineering problems. XIII. The flexure and 

extension of perforated elastic plates. By R. V. Southwell, F.R.S. . . 147 

The limits of flame propagation at atmospheric pressure. 1. The influence of 

‘promoters’. By Sir Alfred Egcrton, See.R.S. and J. Fowling . . . 172 

The limits of flame propagation at atmospheric pressure. 11. The influence of 
changes in the physical properties. By Sir Alfred Egerton, Seo.R.S. and 
J. Powling , .190 

Bakerian Lecture. Some problems in connexion with tho development of a high 

speed Diesel engine. By H. R. Ricardo, F.R.S. (Plato 4) 210 

Three-dimensional stress system in isotropic plates. II. By A. E. Green and 

T. J. Willmore.229 

On the interpretation of the Foucault test. By E. H. Linfoot .... 248 

The hydrodynamics of non-Newtonian fluids. I, By R. S. Rivliri . . 260 

A theory of the yielding and plastic flow of anisotropic motals. By It, Hill . 281 








iv 


Contents 


No. A 1034. 2 July 1948 

PAGE 

Self-consistent field, with exchange, for nitrogen and sodium. By 1). R. Hartree, 

F.R.S. and the late W. Hartree.299 

The accuracy of atomic co-ordinates derived from Fourier series in X-ray 

crystallography. V, By A. D. Booth and K. H. V. Britten , . . 305 

A study of the interaction of radio waves. By J. A. Ratcliffe and I. J, Hhaw . 311 

The excitation of nuclei by electrons. By I. N. Sneddon and B. F. Tousehek . 344 

The mechanism of the carbon dioxide-carbon reaction. By J. Gadsby, F. J. Long, 

P. Sleightholm and K, W. Sykes.. . .357 

The mechanism of the steam-carbon reaction. By F. J. Long and K. W. Sykes . 377 

The control of the temperature of endothermic reactions in flow systems. By 

J. Gadsby and K. W. Sykes.400 

The use of interlacing nets for the application of relaxation methods to problems 
involving two dependent variables. By D. C. Gilies (with a foreword by 
W. G. Bickley).407 

No. A 1035. 21 July 1948 

The Wellcome Research Institution. By C. H. Kellaway, F.R.S. (Plates 5 to 8) . 435 

The electronic structure of conjugated systems. Parts III and IV. 

III. Bond orders in unsaturated molecules. ..447 

IV. Force constants and interaction constants in unsaturated hydrocarbons. 456 
By G. A. Coulson and H, 0. Longuet-Higgins 

The thermal etching of silver. By B. Chalmers, R, King and R. vShuttleworth. 

(Plates 9 to 12).465 

The theory of the galvanomagnetio and thermoinagnetic effects in metals. By 

E. H. Sondheimer.484 

Measurements in the reaction zone of a Bunsen liame. By H. Klaukens and H. G. 

Wolfhard. (Plate 13).512 

The influence of incombustible vapours on the limits of inflammability of gases 

and vapours in air. By J. H, Burgoyne and G. Williams-Leir . . . 525 

Decay of isotropic turbulence in the initial period. By G. K. Batchelor and A. A. 

Townsend .. 539 

Interaction of mesons with a potential field. By J. C. Gunn .... 569 

Index. i . . . • 580 









The lattice spacings of solid solutions of different 
elements in aluminium 

By H. J. Axon, D.Phil. and W. Httme-Rothery, F.R.S. 

The Inorganic Chemistry Laboratory , University of Oxford 

(Received 14 August 1047 —Read 12 February 1948 ) 


Measurements have been made of the lattice spacings of solid solutions of lithium, magne¬ 
sium, silicon, copper, zinc, germanium and silver in aluminium. The lattice of aluminium is 
expanded by the solution of magnesium or germanium, and contracted by tho solution of 
lithium, silioon, copper or zinc. No change in lattice spacing can be detected when 
silver is dissolved in aluminium, although microscopic examination shows that a solid 
solution is formed, and this is confirmed by the absence of any diffraction lines other than 
those of tho solid solution in aluminium. 

If the lattice spacing/composition curve for dilute solutions is extrapolated to 100% of 
solute, the resulting lattice spacing refers to a hypothetical face-centred cubic modification 
of the solute, and the corresponding closest distance of approach of the atoms is called the 
apparent atomic diameter (a.a.i>.) of the solute when in solution irt aluminium. Previous work 
enables the corresponding a.a.u. values to be deduced for tho above solute elements when 
dissolved in univalent copper, silver or gold, and in divalent magnesium. Tho differences 
between the a.a.d. values of a given element when dissolved in various solvents are 
discussed, and it is suggested that they are controlled by the interplay of four factors: 
(1) the relative volume per valency electron in crystals of the solvent and solute, (2) the 
relative radii of the ions of solvent and solute, (3) Brillouin zone effects, and (4) the difference 
between solvent and solute in the electrochemical series. If this line of approach adopted 
be correct, it follows that it is only in exceptional circumstances that the so-called 
Vegard’s law will apply to metallic solid solutions. 


1. Introduction 

The lattice spacings of solid solutions of diff erent elements in copper and silver were 
investigated by Hume-Rothery, Lewin & Reynolds (1936), and later by Owen & 
Roberts (1939), who also examined solid solutions in gold (1945). In these alloys the 
solvent metal is univalent and the electron states of the first Brillouin zone are only 
half filled. The experimental results show that for any one solvent the lattice dis¬ 
tortion produced by a given atomic percentage of the solutes in any one row of the 
periodic table (e.g. Ag, Cd, In, Sn, Sb) increases algebraically with the valency of the 
solute. The lattice spacings of solid solutions in magnesium were investigated by 
Hume-Rothery & Raynor (1940a), and by Raynor (1942). In these alloys the solvent 
metal is divalent, and with two valency electrons per atom, overlapping has occurred 
in some directions of the first Brillouin zone of the close-packed hexagonal structure. 
In the magnesium-rich alloys, the acUjMon of relatively small amounts of solute of 
higher valency produces an overlap Mmother direction of the Brillouin zone, and 
the relations between lattice spacings and composition become more complicated. 
The present paper describes an investigation of the lattice spacings of solid solutions 

[ 1 ] 


Vbl. 193. A. {22 April 1948) 


1 



2 


H, J. Axon and W. Hume-Rothery 

of lithium, magnesium, silicon, copper, zinc, germanium and silver in aluminium, 
this metal having been chosen as an example of a trivalent solvent with a face- 
centred cubic structure. The above solute elements are the only elements which 
dissolve in aluminium to an extent sufficiently great to permit an accurate lattice 
spacing/composition curve to be determined. The experimental methods and results 
are described in §§2 and 3 respectively, whilst in §4 the theoretical implications 
of the results are described, and the validity of Vegard's law is discussed. 

2 , Experimental details 
(a) Metals used 

The aluminium used in the present work was super-purity metal supplied by the 
British Aluminium Co. Ltd 1 ., the certificate of analysis showed the following 
impurities: 


Silicon 

0-0015 % 

Iron 

0-0010 % 

Magnesium 

0-0010 % 

Copper 

0-0020 % 

Aluminium (by difference) 

99-9946 % 


The lithium used was from a batch of specially pure metal presented by Professor 
R. S. Hutton. This metal has not been analyzed in the elementary state, but a series 
of magnesium-lithium alloys prepared from it gave no indication of impurities. 
The copper was of 99*995 % purity and was kindly presented by the British Non- 
Ferrous Metals Research Association. The magnesium was provided by the Ministry 
of Supply, and was of Dominion origin, the purity being of the order 99*97 %. The 
silicon was introduced in the form of a high-purity master alloy of aluminium and 
silicon which was supplied by the British Aluminium Co. Ltd. The silver was pure 
assay silver, and the germanium was specially purified metal, these two metals being 
supplied by Messrs Johnson Matthey and Co. Ltd. The zinc was kindly presented 
by the Imperial Smelting Corporation, and was of a purity exceeding 99*99 %. 

(b) Preparation and heat treatment of ingots 

The ingots used in the present work were prepared by melting the super-purity 
aluminium in crucibles lined with alumina-fluorspar mixture, using a standard flux, 
the solute metal being introduced either in the elementary state, or in the form of 
a master alloy. After thorough stirring with an alumina rod, the alloy was cast into 
heavy copper moulds, to give cylindrical ingots fin. in diameter. The ingots were 
placed in alumina collars in sealed evacuated hard glass tubes, and were annealed 
for periods of the order of 1 week at a tengteature of approximately 500 ° C, after 
which they were quenched in iced water. A raRaoe layer of about 1 mm. was removed 
on a lathe, and the remainder of the ingot was then used for the preparation of the 
filings. 



3 


Lattice spacing# of solid solutions of elements in aluminium 

(c) Preparation and heat treatment of filings 

From 0*5 to 1*0 g. of filings were prepared from each ingot, using a dead-smooth 
file. The filings were cleaned with ether, using the methods described by Hume- 
Rothery & Reynolds ( 1936 ). The clean dry filings were then placed in thin alumina 
thermocouple sheaths in sealed evacuated glass tubes; in this way the filings were 
prevented from coming into contact with glass, and contamination by silicon was 
avoided.* The exact heat treatment given to the filings depended on the system 
concerned. For alloys lying within the range of the solid solution at low tempera¬ 
tures, the annealing treatment was followed by slow cooling. In many systems the 
limits of the solid solution under equilibrium conditions at low temperatures are 
so restricted that it was necessary to quench the filings after annealing at a tempera¬ 
ture at which they were homogeneous. In some cases it was sufficient to cool the 
sealed glass tube in a blast of cold air, w hilst in others more drastic quenohing was 
necessary in order to retain the homogeneous solid solution. In all sucfy cases the 
results were examined carefully to ensure that the measured lattice spacings referred 
to the true homogeneous state, and not to *a partly decomposed specimen. After 
quenching, the filings were washed with alcohol and ether, and were dried in a 
vacuum. The dried filings were then passed through an 80-mesh sieve in order to 
remove broken glass, and any unduly large metallic particles. 

(d) Preparation of X-ray specimen and analysis of filings 

In the present work no results were obtained which suggested that the filings were 
of uneven composition, but the possibility of slight transverse segregation f in the 
ingots could not be avoided, and it was therefore desirable to carry out the chemical 
analyses on samples which were as nearly as possible identical with those of the 
X-ray specimen. For this reason the material which had passed the 80-mesh sieve 
was used both for the preparation of the X-ray specimen and for the chemical 
analysis. Experiment has shown that the use of a finer sieve has very little effect 
on measurements of lattice spacing, and it is felt that for the present purpose the 
increased accuracy in the analysis^: outweighs any advantage which might be 
gained by the use of a finer sieve. 

The X-ray specimens were made by mixing a small portion of the filings with 
a solution of Canada balsam in xylol until a paHty mass was obtained. The paste 
was then rolled into thin cylinders of diameter about 0*4 mm., and after these had 
been dried they were mounted in the X -ray camera. 

The chemical analyses of the specimens were carried out by Messrs Johnson 
Matthey and Co. Ltd., and the authors must express their thanks to Mr A. R. Powell 
for his care and attention. In a few cases particles of glass were found mixed with the 

* Experiment has shown that aluminium filings pick up silicon when annealed in glass tubes. 

t Experience with many polyphase aliarinium alloys has shown that slight transverse 
segregation is often present in chili castings^ 

X It will be appreciated that if a very fine sieve is used, either the chemical analysis has 
to be carried out on a very small sample, or, ^tentatively, a much larger sample of metal has 
to be filed up, in which case the possibility of uneven composition is increased. 



4 H. J. Axon and W. Hume-Rothery 

filings, and were weighed separately. All alloys were then analyzed for both con¬ 
stituents, and no indication of contamination was found. The work was straight¬ 
forward except for the aluminium-germanium alloys for which very low results were 
at first obtained for the germanium. This source of error was eventually traced to 
the escape of germanium in the form of hydride, and the difficulty was avoided by 
dissolving the alloy in a mixture of nitric and sulphuric acids. 

(e) X-ray technique, and lattice-spacing measurements 

The Debye-Scherrer photographs were taken in a 19 cm. diameter Unicam camera, 
using unfiltered copper radiation obtained from a Metro-Vickers demountable 
X-ray unit. With this radiation the pure aluminium gives (511), (422) and (420) 
reflexions at angles of 82, 69 and 58° respectively. The films were all of good quality, 
and could bo measured accurately on a travelling microscope which had been tested 
against a standard scale calibrated at the National Physical Laboratory. The lattice 
spacings were calculated by standard methods, using the extrapolation function of 
Nelson & Riley ( 1945 ). During an exposure the temperature was measured by means 
of a thermometer placed on the camera, and for convenience all lattice spacings were 
calculated for a standard temperature of 25° C. For this purpose the coefficient of 
linear expansion was assumed to be 23*1 x 10 6 , the value for pure aluminium."' 

The values given for the lattice spacings are in kX units, and have not been 
corrected for refractive index. This policy has been adopted in order to make the 
data directly comparable with those of most other authors. 

3 . Experimental results 
(a) Pure aluminium 

In the course of the present work nine determinations of the lattice spacing of 
pure aluminium were made, involving the preparation of five completely different 
samples of filings. The measured lattice spacings, reduced to 25° G, lay within the 
extremes 4*0412(8) and 4*0414(2) kX, with a mean value of 4*0413(4) kX at 25° C. 
This value is in good agreement with the valuesf 4*0413(6) kX of Jette & Foote 
( 1935 ), and 4*0413(l)kX of Wilson ( 1941 ), and is within 0 * 0002 kX of the work of 
several other investigators. 

{b) Solid solutions of lithium , silicon , copper and zinc in aluminium 

The lattice spacings obtained in the course of the present work are given in 
table 1 , and the relations between lattice spacings and composition are shown in 
figures 1 and 2 . All these elements X produce a lattice contraction, and the following 
points may be noted. 

* Experimental work has shown that the differences between the coefficients of expansion 
of the various alloys produces a negligible effeoLwhen the correction is for 10° C or less. 

t In comparing the different papers care must be taken to see whether the correction for 
refractive index has been applied. 

| The data for the aluminiumdithium alloys*are included in figure 2 to avoid overlapping 
of points in figure 1. 



Lattice spacings of solid solutions of elements in aluminium 


5 


Table 1 

The table contains the measured lattice spacings of the aluminium-rich solid solutions 
which have been investigated. All values have been corrected to 25° C. 


solute 

weight % 


atomic % 

lattice spacing 

element 

solute 


solute 

at 25° C kX 

Li 

0-91 


3*44 

4*0396(7) 


1*29 


4*83 

4*0392(7) 


1*93 


7*11 

4*0381(9) 

Mg 

0*31 


0*35 

4*0422(1) 


0*57 


0*63 

4*0428(6) 


0*73 


0*81 

4*0435(7) 


M3 


1*26 

4*0458(3) 


2*14 


2*37 

4*0495(7) 


2*69 


2*97 

4*0611(2) 


3*17 


3*50 

4*0543(6) 


3*72 


4*11 

4*0563(9) 


5*06 


5*58 

4*0630(6) 


6*14 


6*76 

4*0688(7) 


8*86 


9*73 

4*0820(9) 


9*62 


10*56 

4*0860(8) 

Si 

0*24 


0*23 

4*0408(1) 


0*36 


0*35 

4*0406(5) 


0*70 


0*67 

4*0402(1) 


0*77 


0*74 

4*0400(4) 


0*81 


0*78 

4*0399(0) 


0*95 


0*91 

4*0397(3) 


0*96 


0*92 

4*0396(4) 


0*97 


0*93 

4*0396(0) 

Cu 

0*11 


0*06 

4*0411(7) 


0*93 


0*40 

4*0394(0) 


1*82 


0*80 

4*0372(1) 


2*88 


1 23 

4*0353(9) 


3*12 


1*37 

4*0347(8) 


3*71 


1*62 

4*0335(5) 

Zn 

1*05 


0*43(6) 

4*0412(2) 


4*09 


1*70 

4*0403(8) 


6*45 


2*76 

4*0396(7) 


6*78 


2*91 

4*0393(5) 


12*24 


5*44 

4*0373(3) 


13*33 


5*96 

4*0369(6) 


14*88 


6*72 

4*0360(5) 

Ge 

1-0 


0*37 

4*0418(8) 


2*34 


0*88 

4*0426(0) 


2*99 


1*13 

4*0431(1) 


3*68 


1*40 

4*0434(7) 


4*85 

**> 

1*86 

4*0444(8) 


5*06 


1-98 

4*0446(6) 

Ag 

10*0 


2*7 

4*0413(3) 


20*0 


6*0 

4*0413(6) 

Pure aluminium of 99*9945% purity 


4*0413(4) 



0 


H. J. Axon and W. Hume-Rothery 

In the system aluminium-zinc (figure 1) the lattice-spacing composition relation 
is not linear, hut is a curve such that the lattice contraction per atomic per cent of 
solute increases with the concentration. The points lie well on a smooth curve with 



Figure 1 



Figure 2 

one exception, for which the divergence is 0*0002 kX. In the system aluminium- 
copper the lattice spacing/composition relation is linear. One point lies slightly 
below the best straight line, but the difference is within the limits of the experi¬ 
mental accuracy. In the system aluminium-silicon , a straight line can be drawn 
through all the points within the limits of experimental accuracy. In this system 
the solid solution is restricted, and experiments were made with two different rates 
of quenching, in order to ensure that partial decomposition had not occurred. The 





Lattice spacing# of solid solutions of elements in aluminium 7 

points marked A and B were obtained in this way, and their good agreement sug¬ 
gests that the solid solution had been retained. In the system aluminium-lithium 
the previous work of Zintl & Woltersdorf ( 1935 ) suggested that the lattice of 
aluminium was expanded by the solution of lithium, whereas the data of Vosskuhler 
( 1937 ) indicated a contraction. The present work confirms that a contraction of the 
lattice takes place, and as will be seen from figure 2, the lattice spacing/composition 
line is slightly curved. 

(c) Solid solutions of magnesium, germanium and silver in aluminium 

The lattice spacing of aluminium is expanded considerably by the solution of 
magnesium and germanium, and is unaltered by silver. The results obtained are 
given in table 1 , and are shown in figures 1 , 2 and 3. 



In the system aluminium-magnesium , the limits of the aluminium-rich solid 
solution are approximately 16 atomic % magnesium at 450° C, and 3 atomic % 
magnesium at 200 ° C. It was found that decomposition during quenching was 
effectively prevented by cooling in a blast of air. As will be seen from figure 3, ten 
of the twelve observed points lie on a curve which, after 2 atomic % magnesium, is 
practioally a straight line, but which shows a definite curvature at the lower per¬ 
centages. The two remaining points lie above and below this line by amounts of 
the order (hOOl A which is equivalent to 0-25 % on the composition axis. It is 
thought that most of this discrepancy is due to the analyses rather than to the 
lattice-spacing measurements, because the films were of good quality and could be 
measured well. A great deal of attention was given to the question of the curvature 




8 H. J. Axon and W. Hume-Rothery 

at low percentages of magnesium. The form of the lattice/spacing composition curve 
naturally suggests the presence of an impurity which is removed on alloying, but it 
would be necessary to assume a value of 4*0404 kX for the lattice spacing of the 
solvent metal in order to produce a lattice spacing/composition curve which was 
free from abnormal curvature at low percentages of magnesium. This would be in 
contradiction to the results of all previous investigators, and would also lead to 
improbable curvatures in the lattice spacing/composition curves for the other alloys 
investigated, unless it were assumed that the hypothetical impurity remained 
constant in these. No convincing explanation based on errors in analysis could be 
found to explain the abnormal curvature, and in general the synthetic and analyzed 
composition of the specimens were in good agreement. It seems, therefore, that 
additions of less than 1 atomic % of magnesium produce a slightly smaller expansion 
in the lattice as compared with the effect of larger percentages. The present results 
are in very good agreement with two points obtained for 0*01 and 9*01 atomic % 
magnesium by Brick, Phillips & Smith ( 1935 ), who showed that the use of quenched 
rods gave erroneous values owing to the straining of the specimen, although quenched 
filings gave accurate results; unfortunately, these investigators did not examine 
alloys with lower percentages of magnesium. 

In the system aluminium-germanium , the lattice spacing/composition relation is 
very slightly curved, and as will be seen from figure 2 the experimental points are 
quite consistent. As explained on p. 4 the analyses of these filings gave rise to 
difficulties, and much additional work had to be carried out before the problem 
was solved. As a result of this work it was shown that the intended compositions of 
these alloys were attained to a high degree of precision, and the point marked with 
an open square which refers to an unanalyzed specimen has therefore been included. 

In the system aluminium-diver the addition of 6 atomic % of silver produces no 
measurable change in the lattice spacing of silver. Only two alloys have been analyzed, 
and the points for these are shown in figure 1 , but three measurements were also 
made for other alloys. The results confirmed that no detectable change in lattice 
spacing could be observed. The microstructures of the annealed and quenched 
alloys were therefore examined, and were found to indicate a homogeneous solid 
solution. The constancy of the lattice spacing appears therefore to l>e a genuine 
characteristic of the solid solution, and does not indicate a negligibly small solid 
solubility of silver in aluminium. This was confirmed by the absence of extra lines 
on the X-ray films. 

4 . Discussion 

(a) The apparent atomic diameters 

The results described in § 3 show clearly that the lattice spacing of aluminium is 
expanded by the solution of magnesium and germanium, and contracted by the 
solution of lithium, silicon, copper and zinc. In the alloys with silicon, copper and 
silver the lattice spacing/composition curves are straight lines, and if these are 
extrapolated to 100 atomic % of solute we obtain what may be called the apparent 



Lattice spacings of solid solvXions of elements in aluminium 9 

lattice spacings a of the solute in a hypothetical face-centred cubic structure, and 
the corresponding apparent atomic diameters are equal to £(a>/2); we shall use the 
abbreviation a.a.d. for this quantity. In the alloys with lithium and zinc, the lattice 
spacing/oomposition curves are not exactly straight lines, but the curvature is so 
slight that tangents can easily be drawn to the curves at zero concentration of the 
solute, and the corresponding a.a.d. values can be obtained. In the alloys with 
magnesium, the abnormal curvatures referred to on p. 8 makes the above treat¬ 
ment less certain, and we have therefore estimated two a.a.d. values, one referring 
to the curve at zero concentration, and the other to the almost linear portion of 
the curve. 

In this way the a.a.d. values for the above elements, when dissolved in aluminium, 
can be obtained, and existing data enable us to deduce the corresponding a.a.d. 
values for Al, Si, Zn, Go and Ag when dissolved in copper, and for Mg, Al, Cu, Zn 
and Ge when dissolved in silver, anti for Al, Cu, Zn and Ge when dissolved in gold. 
Since copper, silver and gold are univalent solvents the first Brillouin zone is only 
half filled, and the a.a.d. values deduced from the lattice spacings of dilute solutions 
are therefore free from any complications resulting from b.z.* effects, and refer to 
a crystal structure (face-centred cubic) which is the same as that of aluminium. 
For solid solutions in magnesium the variation of the a-spacing with composition 
enables a.a.d. values to be obtained for lithium, aluminium and silver. These refer 
to a condition after a b.z. overlap,f and since the crystal structure is normal close- 
packed hexagonal (cja « 1*0235), the co-ordination number is 12 as in the face- 
centred cubic structure. For abbreviation we use the symbol a to denote the 
distance between an atom and its six neighbours in the basal plane, and the 
symbol d for the distance between an atom and its six other close neighbours, 
three in the layer above and three in the layer below. In the ideal case where the 
axial ratio equals ^/(|), a and d are equal. 

The above procedure enables us to compare a.a.d . values of the elements concerned 
when in dilute solid solution in univalent copper, silver and gold, divalent magnesium 
and trivalent aluminium. All these solvents crystallize with structures of co¬ 
ordination number 12, and so far as the crystal structures of the solvents are con¬ 
cerned, the conditions for comparison are favourable, and comparison may also be 
made directly with the atomic diameter (a.d.) values (i.e. the closest distances of 
approach) in the crystals in the elements aluminium, magnesium, copper, silver 
and gold. 

Lithium crystallizes in the body-centred cubic structure of co-ordination number 
8 , whilst silicon and germanium crystallize in the diamond type of structure of 
co-ordination number 4, and it is clear that the different co-ordination numbers 
introduce a further complication in comparing the a.d. and a.a.d. values. According 

* For abbreviation the letters B.z. are used for Brillouin zone. 

t The variation of c-spacing in solid solution in magnesium cannot be used to calculate 
a.a.d. values because as little as 1 atomic % of a trivalent element causes a new b.z. overlap 
in this direction (see Hume-Rothery & Kaynor 1940 ; Raynor 1942 ). 



10 


H. J. Axon and W. Hume-Rothery 

to Goldschmidt contractions of 4 and 12 % result from changes of co-ordination 
number 12-*>8 and 12-*4 respectively. Examination of the original Goldschmidt 
papers ( 1928 ) shows that many of the so-called Goldschmidt atomic diameters were 
obtained from X-ray crystal data for metallic phases which are now known to 
involve b.z. overlaps, so that the whole position is much more complicated than was 
originally imagined (see p. 15). The above contractions of 4 and 12 % must therefore 
be regarded only as giving a rough indication of the magnitudes involved, and for 
this reason we have not tabulated values of the Goldschmidt atomic diameters, 
although we shall refer to these occasionally. The data obtained are summarized in 
table 2 which also gives the a.a.d. values of gallium, cadmium, indium and tin referred 
to later. Examination of table 2 will show that for the solid solutions in aluminium 
the following principles are apparent: 

(a) The tetravalent elements silicon and germanium when in solid solution in 
aluminium give a.a.d, values greater than those for solid solutions of the same 
elements in univalent solvents, and greater than the a.d. values. The effect is so 
marked that the a.a.d. values would be greater than the a.d. values increased by 
12 %, i.e. greater than the Goldschmidt atomic diameters. 

(b) The univalent elements lithium, oopper and silver when in solid solution in 
aluminium give a.a.d. values smaller than those for solid solutions in univalent 
elements, and smaller than the a.d. values; this effect is particularly marked for 
lithium. 

Table 2 

solutes 


solvent 

Li 

Mg 

Al 

Si 

Cu 

Zn 

Ga 

Ge 

Ag 

Cd 

In 

Sn 

copper 

— 

— 

2-71 

2*59 

— 

2*70 

2*75 

2*78 

2*95 

3*10 

3*20 

3*26 

silver 

— 

2*94 to 
2-95 

2*97 

— 

2*63 

2*76 

2*83 

2*90 

,— 

3*03 

3 10 

3*18 

gold 

— , 

— 

2*80 

— 

2*63 

2*74 

2*83 

2*93 

— 

3*00 

3*07 

3*14 

magnesium 

0 = 3-11 

— 

2*81 

— 

— 

— 

2*83 

— 

2*64 

3*00 

3*08 

3*13 


d = 3*00 

— 

2*85 

— 

— 

— 

— 

— 

2*53 

— 

3*16 

— 

aluminium 

2-82 

3-15or 

304 

— 

2-73 

2*51 

2*85 

— 

2*96 or 
2*98* 

2*86 


— 

— 

interatomic 
distances 
in element 

303 

319 

2*88 

2*35 

2*55 

a 2*66 
d = 2*90 

_ 

2*44 

2*88 


—* 

— 

co-ordination 8 12 12 4 

number in 
element 

Table 2 shows a.a.d. values for solid solutions 

12 

of 

6 


4 

12 





Al, Si, Zn, Ge, Ag, Ga, Cd, In and Sn in copper, 

Mg, Al, Cu, Zn, Ge, Ga, Cd, In and Sn in silver, 

Al, Cu, Zn, Ge, Ga, Cd, In and Sn in gold, 

Li, Al, Ag, Ga, In and Sn in magnesium, 

Li, Mg, Si, Cu, Zn, Ge and Ag in aluminium, 

together with values for the interatomic distances in elemental crystals of Li, Mg, Al, Si, Cu, Zn, Ge and Ag. 

* The two estimates for the a.a.d. value of Ge in Al arise from the curvature of the lattice spacing/ 
composition curve in a manner similar to that already discussed for the solution of Mg in Al. 



Lattice spacing# of solid solutions of elements in aluminium 11 

(c) The divalent elements magnesium and zinc when dissolved in aluminium give 
a.a.d. values which are slightly larger than those for solid solutions in univalent 
elements. The a.a.d. values from solutions in aluminium are smaller than the 
distances of approach in the crystals of zinc (d = 2-907 kX) and magnesium 
(d « 3*20kX) in the direction of the main B.z. overlaps. The a.a.d. value of zinc 
from solution in aluminium (2*85kX) is larger than the a distance of approach in 
the orystal of zinc ( 2 * 66 kX); and this point is discussed later. 

( b ) Vegard's law 

According to Vegard’s law ( 1928 ) the lattice spacing/composition curve for a 
solvent and solute of the same crystal structure is a straight line joining the values 
for the two pure metals, and a similar relation with a correction for effects of co¬ 
ordination number exists for the a.d. values when the? two metals have different 
structures. This law was originally ( 1921 ) advanced for salts in which oppositely 
charged ions are drawn together by Coulomb forces until their electron clouds 
overlap and produce intense repulsive forces which balance the attraction. It was 
later ( 1928 ) extended to metals, and although it is now recognized that many 
exceptionists exist, there is still a general tendency (cf. Huang 1947 ) to regard 
Vegard’s law as a valid approximation. It appears to us that this view is incorrect, 
and that in a metal where the attractive forces are due to the electron gas, it is only 
in exceptional circumstances that we shall expect Vegard’s law to hold. We may 
illustrate the principles involved by considering the solid solution of lithium in 
aluminium. Lithium itself crystallizes in the body-centred cubic structure with 
a =» 3*50kX, and a.d. ~ 3*03 kX. The Li + ion is very small compared with the 
closest distance of approach between two lithium atoms, so that the ions play a 
negligible part in determining the interatomic distances. The volume per valency 
electron, for which we may use the symbol V e * is 21 *5 (kX) 3 , so that the electron gas 
of the valency electrons is very dilute. The detailed calculations of Bardeen ( 1938 ) 
show that the electron density of the valency electrons is reasonably constant over 
a large fraction of the total volume, and that it is only near to the ions that the nodes, 
etc., of the wave function produce marked variations in the valency electron den¬ 
sity, The metal has a high compressibility (K = 8*9 x 10 ia dynes/cm. 2 ) which is 
due mainly to the valency electrons, and only affected very slightly by the extremely 
small ionic overlap. We may therefore regard the crystal of metallic lithium as 
consisting of small Li + ions swimming in a very dilute valency electron gas, which is 
responsible both for the attraction holding the structure together, and for the 
repulsion which gives rise to the resistance to compression. As the attractive forces 
are due to the electron gas as a whole we cannot think of directed forces between 
adjaoent atoms, but we may perhaps represent the relations involved by a model 
such os that of figure 4 . In this figure the different atoms are represented by squares 

♦ For abbreviation we shall omit the units (kX)* in referring to the \\ values of different 
elements. 



12 


H. J. Axon and W. Hume-Rothery 

held together by very weak springs,* V a representing the weak attraction of the 
dilute valency electron gas, and held apart by the very weak springs V r representing 
the repulsion due to the valency electron gas. The much stronger springs l r represent 
the repulsion of the electron clouds of the ions which is not yet effective because of 
the large interatomic distances. The interatomic distance 3*03 kX in lithium refers 
to a body ^centred cubic structure, but the general condition of affairs would be 
the same in a hypothetical face-centred cubic structure, for which Goldschmidt’s 
co-ordination factors would give an interatomic distance of 3 * 12 kX. 


K 



Jr K 


Figuhb 4 

In the face-centred cubic structure of aluminium a.d. — 2 * 8 bkX and the atoms 
are closer together than in lithium. There are now three valency electrons per atom, 
andT^ — 5*51, so that the valency electron gas is on the average about four times as 
concentrated in aluminium as in lithium. This concentrated electron gas is also less 
uniformly distributed than the dilute electron gas in lithium or sodium. As we 
proceed along the series Na->Mg-> Al, the V e values become progressively smaller, 
and the valency electron densities progressively greater, and at the same time the 
valency electrons tend to be more concentrated near to the ions. The melting-point 
of aluminium (650° C) is much higher than that of lithium (152° C), whilst its com¬ 
pressibility is much lower (K ~ 1-37 x 10 18 dynes/cm. a ). The attractive and repulsive 
forces are thus much stronger in aluminium, and are due mainly to the valency 
electrons, for the A1* H ‘ + ionic radii are smaller than the distances between the atoms 
in metallic aluminium, although the ionic overlap is greater than in an alkali metal. 
We may therefore represent the condition of affairs in aluminium by a diagram such 
as that of figure 5, where the valency attractive springs V a and repulsive springs V r 


* The springs will be non-linear and will not obey Hooke’s law. 






Lattice spacings of solid solutions of elements in aluminium 13 

are much stronger than the corresponding springs in figure 4 for lithium, and the 
ionic repulsion springs are more nearly touching. 

We can now readily understand why the a.a.d. of lithium (2*82 kX) when dissolved 
in aluminium is so much smaller than the a.p. value (3*03 kX). For the a.p. value 
of 3*03 kX refers to the equilibrium of the very weak forces represented by the 
weak springs of figure 4, but when one of these atoms is substituted for an 
aluminium atom, the much stronger forces of the aluminium lattice, represented 
by the stronger springs of figure 5, will cause an almost complete collapse of the 
weak springs. In this connexion there is a difference between the behaviour of 
a 1-dimensional row, and a 2- or 3-dimensional lattice. In a 1-dimensional row, 
a single atom could exert its own influence, but in 2- or 3'dimensional lattices a 
distortion round one atom affects the interatomic distances between other atoms. 
We can therefore understand why the a.a.d. (2*82kX) of lithium when dissolved 
in aluminium is nearly the same as the a.p. of aluminium itself (2*86 kX), and 
since the ion of lithium is smaller than the ion of aluminium we can perhaps 
also understand why the a.a.d. of lithium in aluminium is slightly smaller than 
2*86 kX. v 


f 



Fiottre 5 


If the above line of argument be accepted, it will be realized that it is only in 
exceptional cases that the so-called Vegard’s law can be expected to hold for metals. 
So far as we have been able to discover, the system calcium-strontium is the only 
alloy system in which ajincar relation holds between the lattice spacing and atomic 
percentages of the two metals. It is perhaps significant that in this case the V A value 
for strontium is larger than that for calcium, whilst its ionic radius is also larger. 
When strontium is dissolved in calcium we expect the valency electron effect to 
produce a smaller a.a.d. for the same reason as that described above for lithium in 
aluminium, but the larger ion of strontium may produce a counterbalancing 
expansion. 


14 


H. J. Axon and W. Hume-Rothery 

(c) Factors controlling the apparent atomic diameters 

The exact a.a.d. values of one metal in different solvents will clearly depend on 
the exact details of the atomic fields and cannot be predicted, but we suggest that 
a general interpretation may be obtained by considering the following controlling 
factors. 

(1) The V c effect 

In the above description we have seen why the a.a.d. of lithium when dissolved 
in aluminium is so much smaller than the A.r>. for metallic lithium. In this case the 
V e values for lithium and aluminium are 21*7 and 5*5 respectively. The V ft values for 
the remaining elements are given in table 3, and we suggest that as a first approxima¬ 
tion these may be used to indicate the direction of effects analogous to those described 
for lithium and aluminium. That is to say, if the V e value of the solute is greater than 
that of the solvent, we expect the a.a.d. to be made smaller, and, conversely, if the 
V f . of the solvent is larger than that of the solute we expect the a.a.d. to be made 
larger. This simple effect may then be obscured by the complicating factors described 
below. 

Table 3 


element 

for co-ordination 
no. 12 

element. 

V t for co-ordination 
no. 12 

Li 

21*7 

(3a 

e. 6-7 

Cu 

11*7 

In 

c. 8* 9 

Ag. An 

17-0 

8i 

3*2 

Mg 

11*6 

Ge 

3*6 

7ai 

6*7 

8n 

5*6 grey tin 

Od 

9*0 


5*5 white tin 

A1 

5*51 




(2) The effect of ionic radius 

We suggest that if the ionic radius of the solute is larger than that of the solvent, 
the effect is to increase the a.a.d. of the former, and, conversely, if the ionic radius 
of the solute is the smaller, its a.a.d. tends to be smaller. This effect is, however, 
only marked if a relatively large ion is being forced into a structure in which its 
electron cloud begins to overlap the electron cloud of the ions of the solvent, e.g. when 
silver is dissolved in copper. Figure 6 shows the univalent ionic radii of the elements 
concerned, and these may be taken (Hume-Rothery & Raynor 1940 ) as the least 
unsatisfactory values if the ‘size 1 of an ion is to be represented by a single number; 
they must not, of course, be taken to indicate a sharp boundary surface to the ion. 
Thus in metallic copper, the electron clouds of the ions overlap appreciably, and 
are responsible for the low compressibility, although the univalent radius of the 
Cu + ion is only 0*96 kX as compared with a half distance of approach of 1*27 kX. 
It has also been emphasized (Hume-Rothery & Raynor 1938 ) that in the elements 
following an alkali metal (e.g. NaVMg~>* Al->Si) the electron clouds of the ions 
shrink less rapidly than the interatomic distances, so that the crystal structures 



Lattice spaeings of solid solutions of elements in aluminium 15 

become less ‘open ’ with increasing valency, whereas in the elements following copper, 
silver and gold (e.g. Cu~*Zn->Ga->Ge) the ion shrinks the more rapidly and the 
structures become increasingly more open. 



atomic number of clement f 

Fjguke 6. Atomic and ionic radii of the elements. (Reproduced from Phil . Mag. 1038, 26, 
136.) # J (interatomic distance in the crystal of the element). ® Zachariasen univalent ionic 
radii. O Pauling univalent ionic radii, x Goldschmidt empirical ionic radii. 

(3) The effects of Brillouin zone overlap« 

It has been shown by Jones ( 1934 ) that the effect of a B.z. overlap is to produce 
an expansion of the crystal in the direction concerned. Hume-Rothcry & Raynor 
( 1940 ) and Raynor ( 1942 ) have shown that when 3- and 4-valent elements are dis¬ 
solved in magnesium, the lattice spacing/composition curves for the a-spacings are 
smooth, whilst those for the c-spacings show a kink in the direction of an expansion 
after a small percentage of solute (see figure 7). This was taken to imply that in pure 
magnesium a B.z. overlap had already occurred in the a direction, but not in the 
c direction, although the latter overlap occurred after a very slight increase in the 
electron concentration. Hume-Rothery & Raynor suggested that the a.d. in 
aluminium ( 2 - 86 kX) was greater than its a.a.d. when dissolved in copper (2*71 kX) 
and silver or gold (2*79-2*80 kX), because no b.z. overlap was concerned in the 
univalent solutes. From table 2 it is clear that the position is not really so simple; 
thus the a.a.d. value for indium dissolved in magnesium (3*08 kX) is almost the same 
as the values (3*07, 3*11 kX) for solutions in gold and silver,* whilst the a.a.d. 
value for gallium dissolved in magnesium (2*83kX) is smaller than those (2*835, 
2*838kX) for solutions in silver and gold. It is, of course, clear that with a lattice 
spacing/composition curve such as that of figure 7, extrapolation of the curve 
immediately after the overlap will give a higher a.a.d. for the element of higher 

* The solid solution of indium in copper is affected by the overlapping of the ions as 
described below. 




16 


H. J. Axon and W. Hmne-Rothery 

valency than the a.a.d. obtained by extrapolation of the curve immediately before 
the overlap, and similarly extrapolation of a curve after an overlap will give a lower 
a.a.d. value for the element of lower valency. If we consider a continually increasing 
electron concentration, it seems probable that the expansion produced by each 
successive b.z. overlap will remain at higher electron concentrations, but the effects 
of sharp peaks and valleys in the N(E) curve on the lattice spacings cannot be 
predicted without further details. We may therefore expect a general tendency for 
a b.z. overlap to produce an expansion, and for increasing and decreasing electron 
concentration immediately after an overlap to produce large and small a.a.d. values 
respectively, but the results are more complicated than those imagined previously. 



solute, atomic % 

Figure 7 

When we consider the lattice spacing/oomposition curves for 3- and 4-valent 
solutes in magnesium, extrapolation of the a-spacing curve, and of the c-spacing 
curve after the kink , will give us a and c values for a hypothetical structure with b.z. 
overlaps in both a and c directions, and the corresponding a.a.d. values can be 
calculated for the a and d distances of approach. The c-spacing curve before the kink 
is too short for extrapolation. In contrast to this if we consider the corresponding 
curves for a univalent metal dissolved in magnesium, extrapolation will give us 
a and c values for a structure with no overlap in the c direction. 

In the earlier papers of Hume-Rothery & Raynor it was assumed that the con¬ 
dition of affairs in aluminium was one following after the (200 ) b.z. overlap of the 
face-centred cubic structure, since this was what was indicated by the simple b.z. 




Lattice spacing# of solid solutions of elements in aluminium 17 

theory of Mott & Jones ( 1937 ). ft has lately been shown by Matyas* that the 
symmetry of the s and p states results in an N{E) curve of the type shown in figure 8 
in which, although the first zone is very nearly filled, the ( 200 ) overlap has not taken 
plaoe. The unoccupied states in the first zone lie in thin slabs parallel to the ( 200 ) 
faces of the zone. This condition of affairs is quite different from that of the simple 
theory of electrons in the face-centred cubic structure, according to which the 
occupied states would have reached to, and spread out from, the centres of the ( 200 ) 
faces. 



Figure 8 

The new N(E) curve is of very great interest, and offers one explanation of the 
scarcity of elements which form wide solid solutions in aluminium. For with 
elements of both higher and lower valencies, the formation of an appreciable solid 
solution would alter the electron concentration so that the latter corresponded to 

* We have to thank Dr Matyas for allowing us to use these results. 


Vol. 193. A, 


2 



18 


H. J. Axon and W. Hume-Rothery 

a lower position on the N(E) curve, and hence to a structure which became relatively 
less stable compared with other possible structures whose N(E) curves remained 
high. Actually the variotis equilibrium diagrams suggest that at low temperatures 
no element forms an appreciable solid solution in aluminium, whilst at high tem¬ 
peratures zinc is the only element whose solubility exceeds 20 atomic %. The 
complete absence of any solid solutions at low temperatures is not explained by 
the above N(E) curve, since with a slight solid solution the electron concentration 
would still correspond to a relatively high position on the N(E) curve. There is 
also no obvious reason why the trivalent elements gallium and indium do not form 
wide solid solutions in aluminium, and the explanation of this must presumably 
lie in the details of the atomic fields, as distinct from simple electron concentration 
effects. 

The N(E) curve of Matyas shows that in pure ahimini|j$ft there will be slightly 
more than one electron per atom in the states of the second zone, and the Fermi 
surface will lie well beyond the point at which the (111) overlap begins. So far as 
B.z. effects are concerned, we shall therefore expect the aluminium lattice to be 
expanded by the ( 111 ) overlap,* but in aluminium-rich alloys we shall not be in the 
region of electron concentration immediately following an overlap. 

(4) The effect of the electrochemical factor 

It has already (Hume-Rothery 1946 ) been suggested that when the solvent and 
solute differ markedly in the electrochemical series the effect is to introduce forces 
of a partly ionic nature, and we may expect this to produce a contraction of the 
lattice which will make the a.a.d. smaller. 

(d) Detailed examination of the apparent atomic diameters in different solvents 

If the preceding arguments are accepted we may expect the a.a.d, values to be 
determined by the interplay of ( 1 ) the V e effect, ( 2 ) the effect of ionic radius, (3) the 
electrochemical factor and (4) b.z. effects. These different factors may either reinforoe 
or oppose one another, so that the whole problem is very complicated. No suggestion 
is made that the details can be predicted, but the following tentative interpretation 
of the figures in table 2 is offered as indicating that some of the principles have been 
correctly identified. For convenience we may divide the data under the following 
sub-headings. 

Solid solutions in which the V e effect tends to produce a small a.a.d. value 

Of the systems concerned in table 4, the general tendency is clearly for the a.a.d. 
to be smaller than the a.d. An exception is presented by the solid solution of silver 
in copper, and this is the only case in which the formation of the solid solution 
involves the forcing of a larger ion into a structure winch is already of the full type. 
Tins is therefore the system in which the effect of a larger ionic radius in producing 
an expansion is likely to be the greatest, and we can understand why the a.a.d. 
is (V07 kX greater than the a.d. 

* Hume-Rothery & Raynor argued on the assumption of both (111) and (200) overlaps. 



Lattice spacings of solid solutions of elements in aluminium 19 

For the solid solution of lithium in magnesium the a.a.d. values are a — 3*11, 
and d * 3*00 kX, as compared with an a.d. value 3*03 kX. This last value refers to 
a co-ordination number of 8 , and if we use the Goldschmidt correction factor for 
change of co-ordination no. 12 ~> 8 , the value for a hypothetical face-centred cubic 
modification of lithium would be 3-13kX. The a.a.d. values for lithium dissolved 
in magnesium are thus smaller than the a.d. value when corrected for co-ordination 
number. The magnesium-litliium data are from work by Hofmann ( 1936 ), and the 
experimental errors probably give too large* a.a.d. values, so that the actual 
difference between the a.a.p. and corrected a.d. values may be greater than that 
given. 


solute 

solvent 

Table 4 

effect of 



and V 0 

and V, 

ionic radius 

A.A.D. 

A.D. 

SA 21*7 

Al 5*5 

contract 

2*825 

3*03 

Li 21*7 

Mg 11*6 

contract 

arr 3*11 
c*~3*0<) 

303 

Cu 11*7 

Al 5*5 

expand 

2*51 

2*55 

Ag 17*0 

Al 5*5 

expand 

2*86 

2*88 

Ag 17*0 

Mg 11-6 

expand 

a = 2*64 
d~2&3 

2*88 

Ag 17*0 

Cu 11*7 

large expansion: 
ionic overlap 

2*95 

2*88 

Mg 11*6 

Al 5*5 

expand 

3 04 or 3*15 

a = 3*19 
d = 3*20 

Zn 6*7 

Al 5-5 

expand 

2*85 

a ss 2*66 

2*91 


For the solid solution of zinc in aluminium thelj, value of the solvent is only slightly 
less than that of the solute, so that we expect the V e effect to be small. The effect of 
the ionic radius will also be small but tends to produce an expansion. The a.a.d. 
value of 2*85kX lies between the two sets of interatomic distance in zinc, but the 
crystal structure of the latter is so abnormal that detailed comparison is hardly 
justified.! 

Solid solutions of copper, zinc , cadmium , gallium, indium, germanium and tin in 
silver, copper and gold, and of copper in silver; in these, systems the V r effect 
tends to produce a large a.a.d. 

The data for these systems are shown in table 5. For the solid solutions of zinc, 
gallium and germanium in copper, silver or gold, the V e effect leads us to expect a 
large a.a.d., the expansion becoming greater with increasing valency. We can there- 

* The lithium used contained sodium arid potassium, which may cause an expansion of the 
magnesium lattice. Further, any oxidation of the specimens would tend to give too large a 
lattice spacing, and no precautions against this source of error are described. 

f In zinc the main b.z. overlap is in the c direction, and there is a second and smaller 
overlap in the basal plane. The main overlap expands the crystal in the direction of the c-axis, 
and the elastic constants are such that this specimen tends to contract the distances in the 
basal plane, by an effect analogous to that of the Poisson ratio. The value a — 2*66 kX is thus 
probably smaller than if no b.z. overlaps had occurred. 



20 H. J. Axon and W. Hume-Rothery 

fore understand why the a,a.d. values for a series such as (Cu Zn Ga-» Ge) when 
dissolved in copper, silver or gold increases with the valency. Since the V e of copper 
(11*7) is smaller than that of silver and gold (17*0), the V e effect will tend to produce 
a greater expansion for solid solutions in the latter metals, and as will be seen from 
table 5 the a.a.d. values of zinc, gallium and germanium from solid solutions in 
silver and gold are uniformly greater than those from solid solutions in copper, so 
that the general interpretation is confirmed. The same explanation may be offered 
for the fact that when copper is dissolved in silver, the a.a.d. of copper (2*64 kX) 
is greater than the a.d. in copper itself (2*55 kX). 


Table 5 


solute 

solvent 

effect of 



and V 9 

and 

ionic radius 

A.A.D. 

A.D. 

Cu 11*7 

Ag 17-0 

contract 

2*64 

2*55 

Cu 11*7 

Au 17-0 

contract 

2*63 

2*55 

Zn 6*7 

Cu 11-7 

contract 

2*70 

a = 2*66 

2*97 

Zn 6*7 

Ag 17-0 

contract 

2*75 

a = 2*06 
d = 2*97 

Zn 6*7 

Au 17-0 

contract 

2*74 

a = 2*66 
d= 2*97 

Cft c. 6 

Cu 11-7 

contract 

2*75 ) 

abnormal struc¬ 

Ga c. 6 

Ag 17-0 

contract 

2*835 

tures with dis¬ 

Ga c. 6 

Au 17-0 

contract 

2*84 1 

tances from 2*44 

Ga c. 6 

Mg 11-7 

very slight 

2*83 J 

to 2*79 kX 

Ge 3*6 

Cu 11*7 

contract 

2*78 

2-44 

Ge 3*6 

Ag 11-7 

contract 

2*90 

2*44 

Ge 3*6 

Au 11*7 

contract 

2*93 

2*44 

Cd 9*0 

Cu 11-7 

large expansion: 

3*10 

a = 2*97 



ionic overlap 


d = 3*29 

Cd 9*0 

Ag 170 

contract. 

3*04 

a = 2*97 
d = 3*29 

Cd 9*0 

Au 17-0 

contract 

3*00 

a = 2*97 
= 3*29 

In c. 8 

Cu 11*7 

expand: ionic 

3*21 

3*24\ 



overlap 


3*37/ 

In 8 

Ag 17-0 

contract 

3*11 

see p. 21 

In 8 

Au 17-0 

contract 

3*07 

see p. 21 

In 8 

Mg 11-6 

expand 

a =3*08 
d= 3*10 


Sn 5*5 

Cu 11*7 

expand: ionic 

3*27 

2*80 grey 

Bn 5*5 

Ag 17-0 

contract 

3*18 

3*02\ i • j 
3*18/ wh,te 

Sn 5*5 

Au 17-0 

contract 

3*14 

— 


Comparison with the interatomic distances in crystals of zinc, gallium and 
germanium is not so simple. The a.a.d. values of zinc when dissolved in copper, 
silver and gold are smaller than the distance of approach d « 2*97 kX in the crystal 



Lattice spacings of solid solutions of elements in aluminium 21 

of zinc, and this can be understood because the latter involves the main B.z. overlap. 
The value a — 2-60 kX for the closest distance of approach in the basal plane is 
affected by the curious elastic properties of the zinc crystal, and can hardly be 
compared directly with any of the other values.* 

In the complicated crystal structure of gallium, the distances of approach vary 
from 2*44 to 2*79 kX, and the V v effect has produced the expected relatively large 
a.a.d, values. In germanium, the crystal structure is of the diamond type with 
a.i>. 2^44 kX. The a.a.d. values for germanium when dissolved in copper, silver and 
gold vary from 2*78 to 2*93 kX, and are thus larger than the a.p. by more than the 
12 %, which Goldschmidt postulated on the effect of a change in co-ordination 
number 12™>4. The V c principle is thus strikingly confirmed, particularly when it is 
remembered that the a.a.d. values from solutions in univalent solvents are not 
affected by b.z. overlaps, whereas the a.d. value of 2*44 kX for germanium includes 
the effect of B.z. overlapping. 

For the solid solutions of silver, cadmium, indium and tin in copper, silver and 
gold, the effect of ionic radius is now diff erent. The ions of cadmium, indium and tin 
are smaller than those of silver and gold, just as the ions of zinc, gallium and ger¬ 
manium are smaller than those of copper. The ions of cadmium, indium and tin are, 
however, larger than, or at least equal* to, the ions of copper, so that the formation 
of a solid solution in copper results in t he insertion of a larger ion into an already full 
structure, and we suggest that it is for this reason that the a.a.d. values of cadmium, 
i,ndium and tin, when dissolved in copper, are larger than when dissolved in silver 
or gold; the effect is the same as that for silver when dissolved in copper (see p. 18). 
If this interpretation is correct, the system silver-copper is of interest in that the 
large a.a.d. value of copper in silver is the result of the V e effect, whilst the large 
a.a.d. value of silver in copper is due to the effect of ionic radius. 

The relations between the a.a.d. values for cadmium in the univalent solvents, 
and the a.d, values in metallic cadmium are analogous to those described above 
for zinc. In the case of indium, the a.d*. values (d l - 3*24, = 3*37kX) in the 

element are slightly greater than the a.a.d. values in the univalent solvents, and for 
solution in divalent magnesium. According to Raynor { 1940 a) this is due to the 
existence of incomplete ionization in the element. It was not, however, clear why 
the a.a.d. values of indium from solutions in univalent silver and gold (3*11 and 
3*07kX) were as large as the a.a.d. value (a = 3*08, d = 3*10) for solution in mag¬ 
nesium, for the latter refers to a position shortly after a B.z. overlap, whereas the 
former does not. The data in table 5 show that the V r effect is greater for the solutions 
in silver and gold, and so the large a.a.d. values are explained. The same explanation 
accounts for the fact that the a.a.d. value of gallium dissolved in silver or gold is as 
large as the a.a.d, values for the solution in magnesium. But when gallium is dis¬ 
solved in copper (^ 11 * 7 ) the solvent has almost exactly the same V t as magnesium 

* See footnote on p. 19. 

t The solid solubilities of indium and tin in copper indicate that the effect of the ionic 
overlap is considerable (see Hume-Rothery 1946 ). 



22 


H. J. Axon and W. Hume-Rothery 

(V e 11*6), and consequently the V e effects are the same, and the a.a.d. for the solution 
in magnesium (2*83 kX) is larger than that for the solution in copper (2*75kX) 
because of the effect of the b.z. overlap in magnesium.* 

When tin is dissolved in copper, silver or gold the V e effect indicates an expansion, 
and the larger a.a.d. in copper is the result of the ionic radius of tin being of the same 
order as that of copper, but smaller than those of silver and gold. The a.a.d. values 


* the solution 

in silver and gold 

are considerably greater than the 

a.d. (2-80 kX) 

grey tin, and slightly greater 
•ordination effect. 

solute solvent 

than this value corrected for the Goldschmidt 

Table 0 

effect of 

and V, 

and V 4 

ionic radius 

A.A.D. 

A.D. 

Mg 11-6 

Ag 17*0 

contract 

2*94 

a=3*20 
d = 3*19 

A1 5-5 

Cu 11*7 

contract 

2*71 

2*86 

A1 5-5 

Ag 170 

contract 

2*79 

2*81 

A1 5-6 

Au 17*0 

contract 

2*80 

2*86 

A1 5-5 

Mg 11-6 

contract 

a— 2*81 
d = 2*85 

2*86 

Si 3-2 

Cu 11*7 

cofitract 

2*60 

2*35 

Si 3-2 

Al 5*5 

contract 

2*73 

2*35 

Ge 3-0 

A1 5*5 

? 

5*97 

2*44 


Solid solutions of magnesium , aluminium and silicon not j/reviously discussed 

The data for these systems are shown in table 0 , and the following interpretation 
may be offered. 

When magnesium (V e « 11 * 6 ) is dissolved in silver (V e = 17*0) the V e effect indicates 
an expansion, which will be opposed by the relatively strong electrochemical factor, 
and by the effect of ionic radius. The resulting a.a.d. refers to a solid solution in 
which there is no b.z. overlap, and we can understand why the resulting value is 
smaller than the interatomic distances a = 3*20 kX, d = 3*19kX in pure mag¬ 
nesium. 

When aluminium is dissolved in copper, silver or gold, the V e effect indicates an 
expansion whioh is greater for the solutions in silver and gold. We suggest that it is 
for this reason that the a.a.d. values in silver and gold (2*79 and 2*80 kX) are greater 
than that for aluminium in copper (2*71 kX). The ionic radius effect indicates an 
opposing tendency which is greater for the solutions in silver and gold, but since 
this is a contraction it is a relatively small effeot. All these figures refer to a.a.d. 
values for a univalent solvent in which b.z. complications are absent, and as pointed 
out by Hume-Rothery & Raynor ( 1940 ), it is for this reason that they are smaller 
than the a.d. of aluminium itself (2*86 kX) in which a b.z. overlap has occurred. 

* Ah explained above this condition of affairs does not apply to solutions of indium in 
oopper and magnesium, because the solution in copper involves a greater increased ionic 
overlap. 



Lattice spacimjs of solid solutions of dements in aluminium 23 

When aluminium is dissolved in magnesium, the V e effect indicates an expansion, 
and the ionic radius effect a slight contraction. The solution of aluminium in mag¬ 
nesium results in an inorease in the electron concentration shortly after a B,z. 
overlap, and we can understand why the a.a.d. values of a = 2*81, d — 2-85 are 
greater than the a.a.d. values in copper, silver and gold. Comparison with the a.d. 
in aluminium itself is clearly not simple, because both aluminium and magnesium 
involve B.Z. overlaps. 

For the solid solutions of silicon in copper and aluminium the V e effect indicates 
an expansion, which is opposed by the relatively small effect of ionic radius. The 
a.a.d. of silicon dissolved in copper is free from b.z. overlap effects, and it may be 
for this reason that the value (2*60kX) is smaller than the corresponding value 
(2*73 kX) for the solution in aluminium where the solvent has already been expanded 
by one b.z, overlap. Both these values are much greater than the a.d. of silicon 
(2*35kX) which refers to a crystal structure of co-ordination number 4. If this is 
increased to allow for the Goldschmidt change in co-ordination number, the resulting 
value is 2*66 kX. The a.a.d. value for the solution in aluminium is greater than this, 
so that a very considerable expansion has occurred. The a.a.d. value from the 
solution in copper is 0*06 kX less than the Goldschmidt value, and this may be 
because the value 2*35kX for elementary silicon is itself expanded by a B.z. 
overlap.* 

When germanium is dissolved in aluminium the V e effect produces an expansion, 
and the a.a.d. (2*97kX) is greater than the a.d. (2*44kX) and greater than the 
latter corrected by the Goldschmidt co-ordination factor. In this case, as with 
silicon, both aluminium and the a.d. of the solute involve B.z. overlaps, and we can 
understand why the a.a.d. is large, and why it is larger than the corresponding values 
for solid solutions in copper, silver and gold which were dealt with above. 


5. Conclusion 

As emphasized before, no claim is made that the exact a.a.d. values can be pre¬ 
dicted, but it docs appear that the values from the different solid solutions given in 
table 2 can be understood satisfactorily by consideration of the four factors outlined 
in subsection (c) above. In particular, the influence of what we have called the V e 
effect is very striking, and the systems in which it is not the controlling factor are 
clearly those in which overlapping ions, or Brillouin zone effects, are present. 

The authors must express their thanks to Professor C. N. Hinsholwood for 
laboratory accommodation, and many other facilities which have greatly en¬ 
couraged the present research. One of the authors (W.H.-R.) acknowledges his 
gratitude to the Royal Society for election to a Warren Research Fellowship, and 
for a grant towards the expenses of research work. 


* Since silicon contains 4 valency electrons per atom there must be at least one B.z. overlap. 



24 


H, J. Axon and W« Hume-Rothery 


Refkbekcks 

Bardeen, J. 1938 J. Chem . Phys . 6 , 367. 

Brick, R. M,, Phillips, A. <& Smith, C. S. 1935 Tech. Publ. Amer. Inst. Min. Engrs, no, 660. 
Goldschmidt, V. M. 1928 Z . P%*. O/icm. 133, 397. 

Goldschmidt, V. M. 1929 Trans. Faraday Soc. 25, 263. 

Hofmann, W. 1936 Z. Metallkunde, 28, 160. 

Huang, K. 1947 Proc. Roy.' Soc. A, 190, 102. 

Hume-Rothery, W. 1946 A fornic theory for students of metallurgy. Inst. Metals. 
Hume-Rothery, W., Lew in, G. F. & Reynolds, P. W. 1936 Proc. Roy. Soc . A, 157, 167. 
Hume-Rothery, W. & Raynor, G. V. 1938 Phil. Mag. 26, 136. 

Hume-Rothery, W. & Raynor, G. V. 1940 a Proc. Roy. Soc. A, 174, 457. 

Hume-Rothery, W. & Raynor, G. V. 19406 Proc. Roy. Soc. A, 174, 471. 

Hume-Rothery, W. & Raynor, G. V. 1940 c Proc . Roy. Soc. A, 177, 27, 

Hume-Rothery, W. & Reynolds, P. W. 1936 J. Imt. Met. 3 , 541. 

Jctte & Foote 1935 J. Chem. Phys. 3 , 605. 

Jones, H. 1934 Proc. Roy. Soc. A, 147, 390. 

Mott, N. F. & Jones, H. 1937 Proc. Roy. Soc. A, 162, 49. 

Nelson, J. B. & Riley, D. P. 1945 Proc. Phys, Soc. 62, 100 . 

Owen, E. A, & Roberts, E. W. 1939 Phil. Mag . 27, 294. 

Owen, E. A. & Roberts, E. A. O’D. 1945 J. Inst. Met. 71, 213. 

Raynor, G. V. 1940 Proc. Roy. Soc. A, 174, 457 . 

Raynor, G. V. 1942 Proc. Roy. Soc. A, 180, 107. 

Vegard, L. 1921 Z. Phys. 5, 17. 

Vegard, L. 1928 Z. Kristallogr. 67, 239. 

Vosskuhler 1937 Metallvnrtschaft , 16, 907. 

Wilson, A. J. G. 1941 Proc. Phys. Soc. 53, 235. 

Zintl, E. & Woltersdorf, G. 1935 Z. Electrochem. 41, 870. 



Matrices of integers associated with self¬ 
transformations of surfaces 

By T. G. Room, F.R.S., University of Sydney 

(Received 15 September 1947) 


Let c as (Cj, c r ) be a set of curves forming a minimum base on a surface, which, under a 
Holf-tranfifonnation, of the surfaco, transforms into a sat S'c expressible by the equi¬ 
valences = Tc, where T is a square matrix of integers, Further, let the numbers of 
common points of pairs of the curves, c<* l>e written as a symmetrical square matrix I\ 
Then the matrix T satisfies the equation ITT' ^T. 

The significance of solutions of this equation for a given matrix T is discussed, and the 
following special surfaces are investigated: 

§§4-7. Surfaces, in particular quartic surfaces, with only two base curves. Serf-trans¬ 
formations of these depend on the solutions of the Pell equation ti*—kv 9 ss. 1 (or 4). 

§ 8. The quartic surface specialized only by being made to contain a twisted cubic curve. 
This surface has an involutery transformation determined by chords of the OAibic, and 
has only one other rational curve on it, namely, the transform of the cubic. The appropriate 
Poll equation is u 1 — \lv % ® 4. 

§9. The quartic surface specialized only by being made to contain a line and a rational 
curve of order m to which the line is (m— 1)-secant (for m 3 = 1 the surface is made to contain 
two skew lines). The surface has two infinite sequences of self-transformations, expressible 
in terms of two transformations M and V\ namely, a sequence of involutory transformations 
#y'", and a sequence of non-involutory transformations f/’ rx . 


1. Curves on a surface with self-transformations 


For this investigation we suppose that a surface A lias a birational self-transforma¬ 
tion, y, in which there are no singular elements on A. Much of the work could he 
applied to more general transformations, but the statement of the results is much 
simplified by making this assumption. 

If (a, 6} denote the number of points common to two curves a, b on A, and $~a is 
the transform of a, then _ | a ^ 


Let the set of r curves c 1( ...,c r on A form a minimum base in terms of which all 
other curves may be expressed by linear equivalences* Then, in particular, the 
curves 3~c { may be so expressed, say 




(M> 


where the t {j form a square matrix T of integers (positive, negative, or zero). 
For the inverse transformation,.^ x , we shall have also 


y l c k = ^T m c v 

where ~ S ib 

so that the matrix T is non-singular. If we regard the curves c { collectively as the 
elements of a column veotor c, we may write (M) in the form 

(yc) - Tc. 

t 25 ] 


( 1 * 2 ) 



T. G. Room 


. 26 

Consider next the symmetrical matrix T formed by the numbers of intersections 
of pairs of the curves c*, including either the number of intersections of each curve 
with another curve of the linear family defined by it, or, if the curve is unique, the 
number of virtual self-intersections, i.e. 

(1*3) 

(We assume T to be non-singular.) Then 

(R~ C f> c k\) — ({£hj c j> c k}) — i^ijYjk) ~ TT 
and c k }) - ({c^r-%}) « &T k ; 7i{ ) - T(T)~\ 

so that ITT' * T. (1*4) 

Thus 

1 -1. If c « (c { ) is any minimum set of base curves on a surface fi, and T — (R, Cj}) 
is the matrix formed from the numbers of intersections of pairs of the curves, and £1 has 
a self-transformation without singular elements on Q, under which {$~ c) — Tc, then 
the matrix T satisfies the equation 

tit' = r. 

Now let the base c be replaced by another base d given by 

d — He, 

then A - (R, *,)} - ({»«*, SAW) = HTH', (J *5) 

and if, referred to this base, the transformation corresponds to the equivalences 

(,rd) - Ud, 

then ^THc - H^c - HTc « UHc, 

so that T as H ^UH. ( 1 * 6 ) 

As an immediate consequence of equation (1*4) we have 

1*2 | T | - ± 1, 

but this property of T is only a part of the following more general property: 

1*3. In the characteristic equation of T, namely , 

I T- lx | saf + k x x r 3 + ... + i; r = 0, 
the coefficients satisfy the conditions 

( 1 , k l} k r ) — ± ( k r , ki , 1 ). 

That is, if £ is any root of the characteristic equation, so also is £~ J a root. For 

r(T-I*)r ^TTT'-lx 

» T~~ l — lx from (1*4) 

* — T~ 1 (Ta: —I), 



Matrices of integers associated with self-transformations of surfaces 27 

so that | T - lx | s | T' - lx | = | T(T' - lx) I 1 - 1 1 

s | — T -1 11 Tx — 11 s + x r | T — lx -1 1. 

Let . ...T l a, a, .Ta,.T 2 a, ... be the sequence of curves on Q determined from a 

curve a by repeated applications of the transformation and let a itself be given 

by the equivalence , 

(l — ot c, 

a' being a row vector of integers (positive, negative, or zero). Then 
a x = S~ a = jT(a'c) = a'Tc, 
a 2 « 3T % a = ^(a'c) = a'T 2 c, etc. 

From the characteristic equation of T, 

& + k l af- 1 +... + k r = 0, (1-7) 

we derive, for any integral value of #, the matrix identity 
• T* ,+r + fcj T* +f +... + k r T* * 0, 
and thence the general algebraic equivalence relation 

a'(T* +r + k x T *+ r ” 1 + ... + k r T) c - 0 
connecting the curves T*C. That is, 

] *4. The curves in any succession of r + 1 in the sequence a n = derived from 
a curve a by repeated applications of S~, are connected by the equivalence relation 

+ + (1*») 

A recurrence relation with these coefficients connects all sequences of numbers 
associated with the curves (in particular, their orders), but it is not necessarily the 
simplest such relation (see, for example, equation (9*2)), 

Suppose now that v ~ (v t ) = ({«,£*}), 

then (.Tv) * ({^a,c f }) = {{a,S~^ l c^) « T'V ( 1 * 9 ) 

In the problem as usually presented, the transformation S' is defined geometrically, 
and the matrix T corresponding to it and associated with some set of base curves, 
c h has to be determined. In these circumstances, as the examples in §§ 8 and 9 will 
show, it is simpler to calculate the numbers S 'from the numbers v i for a general 
curve, than it is to calculate T directly from the transforms of the curves c { . 

The recurrence relation set out in theorem 1*4 has already been discussed in a 
rather different form (Room (unpublished), part 4, appendix II), and some questions 
have been raised of its significance, i.e. of the extent to which it characterizes the 
surface 12 and the self-transformation S'. Results obtained in this paper tend 
towards the conclusion that at best the characterization is very weak. Thus in §9 



28 


T. G, Room 


we find two birationally distinct quartic surfaces, with distinct self-transformations, 
both yielding the recurrence relation 

Moreover, even T itself is not characteristic, since in § 8 we find, when the base 
curves are suitably selected, the same matrix corresponding to the simple self- 
transformation of a quadric by lines through a point not on it, and to an involutory 
self-transfer mat ion of a quartic surface on which there is a twisted cubic. 

On the other hand, for transformations of the same surface it is clear that: 

1*5. //, on a (regular, normal) surface with a given set of base curves, the matrices 
corresponding to two transformations are identical, then either the transformations 
themselves are identical , or one is the product of the other with a collineation . 

2. An alternative approach 

In this section we consider first the case of curves on a non-singular quartic surface, 
and show how the relation (1*4) may be obtained from the equations that express 
that the genera of curves are invariant under 3T. We use the following results for 
(non-singular) curves on a (non-singular) quartic surface. 

(i) A curve c of genus p has 2 p — 2 self-intersections, i.e. intersections with 
another curve of the linear family it defines if p^ 1 , and virtual self-intersections 
if p s= 0 . This is equivalent to 

(ii) If c is of genus p , then curves equivalent to 2 c are of genus 4p- 3 , so that 
curves equivalent to ac are of genus a 2 (p — 1 ) + 1 . 

Thus if c v c 2 of genera p v p 2 , have y 12 common points, the genus of curves equi¬ 
valent to a 1 c l + a 2 c a is 

7r ( a i c i + a * c *) = *!(3>i-l) + *I(j>«-l) + *ia*y«+l. 

Write now y u for the number of self-intersections of c ( , so that 

2 > 

then rr(ai c, + a, c,) = $ (y u af + 2 y lt a x a, + y 2s a|) +1. 

Now take r curves c v ...,c r (or c) with intersection-matrix ({c*,c,}) = (y iy ) = r, 
the genus of c { being |y i; + 1, and an arbitrary row vector, a', of r integers, then the 
genus of curves equivalent to o'c is 

n(a'c) - JaTo+1. 

Finally, let the curves c form a minimum base on the surface and let them be 
transformed into curves Fc equivalent to Tc under the self-transformation S' 
of the surface, then 

jol c = a Tc, 

* 7r(a'c) = £aTa+ 1, 

Tr^a'c) * i(o'T)r(«T )'+1 
- Ja'(HT0« + l. 



Matrices of integers associated with self-transformations of surfaces 29 
Thus for all values of a' we have 

aTa - a'lTTT'K 

and thus 

2 * 1 . If a non-singular quartic surface has a self-transformation 3", and a minimum 
base c such that {3~ c) * Tc, wnd ({c f , <y}) = T, the condition that expresses that all pairs 
of corresponding curves a'c } *^a'c have the same genus is 

TTT' - r. 

Consider next how this may be applied to a non-singular surface of order n. 
Statements (i) and (ii) are replaced by (i) a curve c of order m and genus p has 
2jt>— 2~~m(n — 4) self-intersections, and (ii) curves equivalent to otc have genus 

<x % {p — 1)— ia(a — 1 )m(n — 4)4-1. 

So that, writing y u for the number of self-intersections of c h i.e. 

7 a “ -4), 

we find that the genus of curves equivalent to a'c is 

n(a'c) » |aTa 4 * — 4) a'm 4 - 1 . 

If there is a self-transformation 3T^ and c is a minimum base 

7t(£T a'c) - n(a'Tc) = Ja'TTTa 4 -|(^4)a'Tm+ 1 . 

But 7 r(a'c) * n^a'c), 

and T-TIT', (1*4) 

so that ot'm « a'Tm, 

i.e. the orders of all curves on the surface remain unchanged under the transforma 
tion, thus 

2 *2. The only self-transformations of non-singular surfaces of orders greater than 
four are coUineations. 

3. Curves on a quartic surface 

Let n be a non-singular quartic surface with a minimum base c containing r 
curves, of which the intersection matrix is I*. We wish to find what geometric 
meaning may be assigned to solutions in integers of the matrix equation 

xrx=r. (i*4) 

Two elementary properties may be set down at once: 

3*1. //X and Y both satisfy equation (1*4), then their product , XY, also satisfies it. 

In particular, X n for all integral values of n , satisfies the equation. Seoondly, 
since | X | — ± 1 (theorem 1 * 2 ), it follows that X -1 is also a matrix of integers, so that 

3 * 2 . If c is a minimum base , and Xisa solution of ( 1*4), then Xc is also a minimum 
base , all the, curves in which have the same genera as the corresponding curves in c. 

Although in general terms it may be said that X corresponds to a transformation 
of it into another quartic surface of the same specification (i.e. having on it the same 



30 


T. G. Room 


systems of curves as n), it does not follow that it corresponds to a birational trans 


formation of n into itself. For example, the matrix X - 



corresponds to a 


transformation of one general determinantal quartio surface into another, but not 
to a transformation of the surface into itself. It happens in this case that X s does 
correspond to a self-transformation of the surface (see § 6 (ii)). 

We may draw two negative conclusions: 

(i) If (1*4) has no solutions in integers, n can have no self-transformations (other 
than collineations, since X = I always satisfies the equation), nor can there be 
transformations other than collineations of n into other quartie surfaces. 

(ii) If all the curves of c are proper irreducible curves, and a solution of (1*4) is 
such that there is no family of proper irreducible curves equivalent to any one of 
the curves Xc, then there is a no self-transformation of the surface. 

Take a particular curve of the set c, say c v If a curve £# u r t * has the same genus 

as it, then y v r r _ v 

**Yij x u x ij ~ 7iv 

This is the first equation of the set (1 *4). A solution of this in integers will not neces¬ 
sarily lead to solutions in integers of the whole set (1*4), so that not all curves of 
the same genus as are necessarily given by solutions of (1*4). 


4. The binary quadratic form and the Pell equation* 


L®t r -(l 5). <“•» 

where a , 6, 2 h have no common factors, j 

d~h 2 ~ah> 0, (4*2) 


and d is not a square. Also let (£, £') be the least positive integers other than (1,0) 
satisfying the Pell equation x 2 — dx ' 2 = 1. (4-3) 

Then one solution of the matrix equation 


is 


xrx' = r 


/£-«' < 4 9 \ 

£+*gr 


(1*4) 

(4-4) 


and all other solutions for wliich | X | = + 1 are positive or negative integral powers 
of X 0 , namely, 


± Xf — ± 


2 Jd 




J(d-h) 




(4-5)t 


* See, for example, Landau ( 1927 ), part I, chapter 7, and part IV, chapter 4. The writer 
is indebted to Mr W. B. Smith-White of the University of Sydney for pointing out the relation 
of the present investigation to results for the binary quadratic form, and for suggestions in 
regard to the Pell equation. 

t There are in effect four forms of each solution, corresponding to the selections of sign in 
(±£» All ore contained in (4-6). Thus since (g + g'^d )” 1 = (g-g'V^b Xj* is the solution 
X! with (£, £') replaced by (£, 



Matrices of integers associated with self-transformations of surfaces 31 

Moreover, all solutions of ax 2 + 2 hxx f + bx ' 2 = a (or 6) 
are included in this form. 

Now, let U be any solution of (1*4) for which | U | ~ — 1, then from the equations 
leading to theorem 1*3, namely, 

| U | | U — la | ™ — | \Jx — 11 » 
it follows that, for matrices of order 2, 


U 2 = I. (4*0) 

If U, V, with | U | = — 1, | V | « — 1, are two solutions of (1*4), then UV is a solution 
with | UV | - + 1, and thus for some value of r 

UV = ±XJ, 

so that all solutions with negative determinant are of the form ± UXJ, where U is 
any one such solution. 

Next suppose that T is such that the highest common factor of a, 6, 2 h is 2, that 

d * h 2 — ab > 0, 


and is not a square, and that (?/, y') is the least solution in positive integers, other 
than (2,0), of the modified Pell equation 

y 2 -dy’ 2 = 4. (4*7) 


then 


y _ Uv~ h n' \ 

u 2 \ -bif' ij + hr/'J 


(4-8) 


is a solution of (1*4), and all other solutions with positive determinant are of the 
form ± Yj, and those with negative determinant are formed as before. 

If a , 6, 2 h have common factors other than 2, these factors must first be removed, 
and the resulting equation (1*4) treated in one of the two ways set out above. 

If d = A 2 — ab is a square, there are no solutions other than X « I with positive 
determinant, and, in consequence, at most one solution with negative determinant. 


5. Quartic surfaces with a minimum base containing two curves 

Let a quartic surface n be specialized only by being made to contain a non¬ 
singular curve of order m and genus p\ we wish to find what families of curves lie 
on n in consequence of this, and what self-transformations, if any, n has. Take as 

base curves on n : , 

c x : a plane section, 

c 2 : the curve of order m and genus #.* 

4 m 
m 2q 

where q ~ p—l. 

* The assumption is made here, and in all corresponding places, that no further curves are 
necessary to constitute the base. 



The intersection matrix is 



32 


T. G. Room 


We find first a solution U of the equation 

xrx=r 

for which | U | = — 1 . Rewrite (1*4) in the form 

u t = ru', 


and let 


«-(; ;)■ 


where uv' - u'v - - 1 . 

Then, from (5*2), u f = \(mu + qv) y v' = - u, 

and from the first equation of the set (1-4), namely, 

4 u 2 + 2 muv 4 2 qv 2 — 4, 

we find u = \{ — mv±^{kv 2 4 16)}, 

where k » m 2 — 8 q, 

A solution of (1*4) is 

\(w—mv) v 


u ‘(i 


— (fc 4 4 q) - £(u» — mw) 

where ( v , w) is a solution of the modified Pell equation 

w 2 ~kv 2 — 16. 


)• 


(1-4) 

(5-2) 

(5-3) 

(5-4) 

(5-5) 

(6-6) 


The solution has to be selected in such a way that all the terms in U are integers. 
If m is even 


W 




(5-7) 


is a solution for all values of q. 

The solutions of (1-4) with positive determinant can all be reduced to the forms 
given in equations (4-4), (4-5), ( 4 - 8 ), with 


a = 4, h - m, b — 2q, d = m* - 8q — k. 


We have firat to verify that k is positive. This is a special case of a theorem due to 
Hodge ( 1937 ), namely, ‘there is exaotly one positive term in the signature of the 
intersection matrix of a base for the algebraio curves on a surface.’ More simply, 
we may in this case use the theorem: * the ourve of maximum genus for a given 
order on a non-singular surface is either a complete section, or residual to a partial 
plane section.’ For a quartic surfaoe and curves of order tn we have 

m = 4r, - 2r*+1, m 2 ~8(p mux -1) = 0, 

4r±l, 2r*±r, 9, 


Thus: 


4r + 2, 2r 2 + 2r, 


12 . 


5‘ 1 . If an irreducible non-singular curve of order m on a quartic surface has genus 
q + 1 , and is not a complete section then 

k — m* — 89 8 r, or 8 r+l, ot 8r-f4, where 1 . 



Matrices of integers associated with self-transformations of surfaces 33 
Consider first the cases in which 

8 < k = m l — Sq a square = A, 2 , say, 

i.e. those for which the only solution with positive determinant is the trivial X = I. 
For other solutions we have, from (5*6), 

w 2 — (hr) 2 ~ 16. 

If h z + 9, the only pair of values is v — 0, w = 4, giving the matrix W (equation (5*7)) 
for even values of m, and no solution for odd values of m. 

If A 2 = 9 we have in addition v — 1, w — 5, and if m = 4**4-1, 

11 = 4 *-' - 1 \ 

W* - 2 ) - (* ~ 1)/ 

is a proper algebraic solution. But the curve given by these values of rn and q is 
residual to a plane cubic, and the surface therefore contains a line, which may be 
taken to be the specialization instead of the curve c 2 (see §7 (i)), so that .this case 
does not require separate consideration. 

Thus there are only two cases for which 

k = A 2 , 

(i) m odd, no non-trivial solution, 

(ii) m even, one solution, namely, W. 

Consider next the solutions for values of k which are not squares. We have to 
take account of common factors among 4, 2m , 2 q t and find four distinct cases: 


(iii) 

m ~ 

2r+ 1 

Y - 1 1 

2 

h - >»»/ *v' \ 

l - 2 ip/' ?I + mil') ’ 

1/ 2 — At/'* = 4. 

(iv) 

rn = 

2r \ 
28+lj 

X. 1 

1 -gi\ e+im?r 

£*-**£'* = 1 . 

(V) 

m = 

q - 

4r + 2l 
2 s 1 

y-li 

(y - inti)', 2?/' \ 

1 -</?/'. y+ \ m) i) ’ 

?i 2 -W 2 = 4 . 

(Vi) 

m = 

4 r\ 

2sj 

X- ( 

l' \ 

, -\gi\ l + lmif 

p-iV*r 2 - 1 . 


6. 

kSOMJS 

SPECIAL 

QITARTIO SURFACES WITH TWO BASE CURVES 


(i) The surfaces for which k — A 2 

The geometric result corresponding to the two eases (i), (ii) of §5 is 

6*1, If a quartic surface is specialized only in such a uoay as try make it contain a 
non-singular cum c of order m and genus q -f 1, and m 2 - Hq is a square, then, if m is 
odd, there are no curves of the same genus as c on the surface other than those belonging 


Vol. 193 . A. 


3 



34 


T. G. Room 


to the linear family defined by c, and, if m is even , none beside that linear family, and the 
family of curves of the same order residual to it. 

If the quartic surface is made to contain a non-singular elliptic curve of order m, 
then q — 0, and the conditions above apply, i.e. 

7-2. If a quartic surface is specialized only in such a way as to make it contain a 
non -singular elliptic curve , e, of order to, then the only elliptic curves on the surface are, 
if rn is odd , the pencil determined by c, and, if m is even, that pencil and the pencil of 
curves of the same order residual to it. 


(ii) The determinantal quartic surface 

If a quartic surface contains a Sehur sex tic curve of genus three, then it contains 
two residual ao 3 families of such curves and is representable by the vanishing of a 
determinant of linear forms. Take as base a plane soction c ly and a Sehur sextic, c 2 . 
Then m — 6, 2</ = 4, & ~ 20, and the surface is of type (v) of § 5. The solutions of 
the equation (1-4) that have positive determinant arc 


= 1 / 9 - 3 *' 2 V ' \ 

2\ -2*' * + 3*V’ 


where (y, y f ) is a solution of 


The smallest significant solution in positive integers is (3,1). leading to 


*.-(_! !)■ _ i)- 


These matrices do not correspond to self-transformations, but 


f. :!) 


corresponds to the well-known self-transformation of the determinantal quartic 
surface, which is represented, roughly, by the interchange of the parts played by 
the rows and columns in the determinant.* 

The principal solution with negative determinant is 


-G o)- 


The principal geometric solution is 


'.*- 0 . -!)■ 


which corresponds to an interchange of the parts played by the two systems of 
Sehur sextics. 

* Due originally to Sehur ( 1882 ), p. 256. See also Jessop ( 1916 , p. 163), and Room (un¬ 
published, parts 1 to 5 ). 



Matrices of integers associated with self-transformations of surfaces 35 


7. Quartic surfaces with two bask curves, one of them rational 

Let a quartie surface n be specialized only by being made to contain a non-singular 
rational curve of order m ; we wish to determine what other rational curves lie on 
the surface in consequence.* 

Take as base curves on the surface 


c x : a plane section, 

c 2 ; the rational curve of order m, 

r -(i .%)• i 

A rational curve on a quartie surface does not define a linear system of curves, 
so that if c — is a rational curve, and n x > 0, > 0, the only curve equi¬ 

valent to c is c itself, namely the reducible curve consisting of the section of n by 
a surface of order n x together with e 2 counted n 2 times. Again, if is negative and 
equal to — ri v the equation c -f n[c x = n 2 c 2 is devoid of geometrical significance, but if 

n x > 0, n 2 = — r?4 < 0, 

the curve may be proper, namely the residual curve cut out by a surface of order 
n x which cuts n in c 2 counted n 2 times. 

Rational curves on the surface are given by solutions of the equation 

4nf + 2 ra/ijWg — 2w| = — 2, 

but the only solutions which can lead to proper curves are those for which 


n x > 0, n % < 0. 

All solutions of this equation are included in the solutions of the matrix equation 

xrx =r (i*4) 

We need only one basic solution with negative determinant; this may be taken 
to be 


U 


c 


For even values of m we have another such solution, namely, 


W 


(; -.)• 


so that Xo = WU = /^ 

is one solution with positive determinant. 


m 

m bn 2 -f 


,) 


(7-1) 

(5-7) 

(7-2) 


* The freedom of rational curves of order m is 4m, and the number of conditions imposed 
on a quartie surface in making it contain the curve is 4m -f 1, Thus the freedom of quartie 
surfaces which contain an unspecified rational curve ia one less than the freedom of general 
surfaces, namely, 33. Likewise the freedom of quartie surfaces containing any unspecified curve 
of order m and genus p is 33 (provided p 4= fm 2 +1), but such a surface contains a linear family 
of freedom p of such curves. 


3-2 



36 


T. G- Room 


For general solutions with positive determinant we have: 



m odd 


*fi \ 

(7-3) 


2 \ 2 p 

p' + mp) 

for any (/?,/?') 

which satisfies 

/?' 2 -(m 2 +8)/? 2 = 

4, 

(7-4) 


m even: 

£_/<*'- \ma. 

2a \ 

(7-5) 



~\ a 

a' + \ma) 

for any (a, a') 

i which satisfies 

a ' 2 -~|(m 2 + 8)a 2 = 

: 1 . 

(7-6) 


Now consider in more detail the solution for odd values of m. Suppose /?, /?' are 
both positive and are any solution of (7*4). Then all the elements in Y are positive, 
and the various possible arrangements of sign are 


v-(:;).»--(!;). :)■ :)• 

Of these only the last can yield a proper rational curve. Next: 

., Y 1 IP' + mP mp' + (w 2 + 4) p\ 

~ 2\ -2 p -(p' + mp) /’ 

YTI - T TY-l _ 1 (P ~ ~ + 4) ^ 

"2\ 2/7 -(P'-mp) I' 

From (7*4) it follows that /?' — mfi > 0 and mf$' — (m 2 + 4) ft < 0 if /? > bn, so that of 
the four matrices +■ UY, — UY, — UY + UY" 1 , only the last can yield a proper 
rational curve, namely, the same curve as is given by -Y b But UY 1 has the 
advantage that it may also give a proper curve to replace the plane section. Thus 
if the surface has a self-transformation the corresponding matrix must be one of 
the matrices UY’" 1 . There are corresponding results for the matrices X and U for 
even values of m. Thus 


7*1. If a quartic surface is specialized only by being made to contain a rational 
curve , then it has at most one self-transformation, and that transformation is involutory. 

We conclude this section by discussing briefly the surfaces with rational curves 
of orders m = 0 , 1 , 2, and in § 8 we discuss more fully the surface on which there is 
a twisted cubic. 

(i) m ±=r 0. The surface with an ordinary node 





Y„U _ UY.-- (* ;*). 


The rational curve derived from Y 0 U, namely, 2 c x - 3c 2 , an octavic with a 6-ple 
point at the node, is a proper curve. It is the intersection of n with the tangent cone 
at the node. Y 0 U, in fact, corresponds to the involution determined on the surface 


by lines through the node. Next 




Matrices of integers associated with self-transformations of surfaces 37 

Since 12c X “ lSc 2 = 6(2c! —3c 2 ), it seems not improbable that the curve 12 c x — 17c 2 
is a proper curve, and likewise that the rational curve obtained from all matrices 
YgU, for positive powers, are proper. 


(ii) m — 1. The surface with a line 

r -(t _*)•Mi -!)• 

Equation (7*4) becomes ft’ 2 — 9/1* — 4, 

of which the only solution is (i* — ± 2, (i — 0, giving Y ~ ± I, and ± U. Thus 

7*2. A quartic surface specialized only by being made to contain a line contains no 
other rational curves. 


(iii) m — 2. The surface, with two coplanar conics 
Take the two conics as base curves, so that 



and solutions are derived as in §5 (v). The Pell equation is 


leading to 



y 2 — Sx 2 - 1 , 

Hi:)- 


In any matrix XJ, or VXj the elements of one row are both positive and the elements 
of the other both negative, thus 

7*3. A quartic surface specialized only so that it contains two copUinar conics has on 
it no other proper rational curves. 


\ 

8. The quartic surface with a twisted cubic 


Let 7 r be a quartic surface specialized only by being made to pass through a twisted 
cubic. Take as base curves on n 

Cji a plane section, 
c 2 : the twisted cubic. 


Then 



and the algebraic solution of equation (1*4) with positive determinant is 

4*' \ 

Y ““2\ 2tj' y-tyT 
if— Hr}* 2 = 4. 


where 



38 


T, G. Room 


The least positive integers satisfying this equation are (66,16) so that 



An algebraic solution with negative determinant is 



and the two simplest solutions which may be capable of geometric interpretation are 


and 



Y 0 U corresponds, as we see immediately, to a self-transformation of the surface, 
and so yields proper curves, of which the rational curve is given as lOCj — 9c 2 . But 
1056^ —593c 2 = 66(16^ — 9c 3 ) + c 2 , 80 that there is no proper rational curve deter¬ 
mined by YgU. Likewise every matrix Yq 1 U yields a curve A(10c i —0c 2 ) 4*//c 2 with 
A, // positive, and there are therefore no other rational curves on the surface. 

Let us consider now the geometric determination of the sell-transformation of 
the surface. Through any point of n a single chord of c 2 may be drawn, and this chord 
meets n in one further point, so that chords of c 2 establish an involution & on n. 
To construct the matrix P corresponding to we have to determine from the 
numbers n H — {a, c ( \ for a general curve, a, the numbers — {fPa, (cf. equation 
(!•»)). 

By degeneration methods we find that the chords of c 2 which meet a form a surface 
of order 4w 1 ~2n 2 on which c 2 is (2^ — w 2 )-ple. Thus which is the order of the 
residua] intersection of this surface with n , is given by 


lPn x =* 4(4% - 2n 2 ) — 3(2% — %) — % = 9% — 5n 2 . 
Since the transformation is involutory 


c x = ;^c x = 9^ — 5c 2 , 


and we may complete the matrix P from equation (1*4), or from P 2 = I. We find 



0>c % is the curve cut out by chords of c 2 which are tangents to n at one of their 
intersections with 7r. Ono tangent of n at a point of c 2 meets c 2 again, and seven chords 
of c 2 through a point of c 2 touch n at their second intersection with c a . Thus on the 
surface of chords of c 2 tangent to n at one of their intersections with c 2 , the curve c 2 
is altogether 8-ple, but one of the sheets of the surface touches n all along c 2 , so that 
its total intersection with n is % counted 9 times, together with 0*c 2 . This is in con¬ 
formity with the figures in the matrix which set out that + 9c 2 is the complete 
section of n by a surface of order 16. 




Matrices of integers associated with self-transformations of surfaces 39 

To determine the order of the double curve, p , of the involution, we may allow 
c z to degenerate into two skew lines l , /', and a transversal m. Transversals of l t l' 
which are tangent to ir elsewhere form a surface of order 8 on which l , V are 4~ple. 
The line m is double on this surface, so that the residual intersection of the surface 
with n is a curve of order 11, counted twice. The lines which meet l and m and are 
tangent elsewhere touch the residual conic in the plane of l and m. Thus the order 
of the complete double curve is 15, and therefore in the case of a non-degenerate 
cubic c 2 , the chords of c 2 which are tangent elsewhere form a surface of order 12 on 
which r 2 is 6-ple, and which cuts n residually in p counted twice, i.e. 

2 p — 12c X “ 6 c 2 . 

The genus of 12c, — Cc 2 is 37, so that the genus of is 10. 

This surface and transformation provide an illustration of how little even the 
matrix corresponding (for a certain base) to the transformation is characteristic. 
Consider the simple involution & of a quadric A determined by lines through a point 
not on A. The effect of & is to interchange the generators of the two systems, so that 
if the base curves g ~ (<j v g 2 ) on A are taken to be one generator of each system, 


we have 

^ = Dg, 


where 

D -C 


But the matrix 

* 

II 

W tw 


is such that 

KDK 1 = P, 


so that 

tf(Kg) = K(0g) - KDg 

= PKg. 

Explicitly 

42 , J + 3(/ 2 \ / » -5\| 

\3t h + KgJ \16 -0)\ 

2 </i + :ty 2 \ 
A<7 i + 5<jJ 


Thus the matrix corresponding to the transformation & of A, when we take as base 
curves on A: 

a (3,2)-curve, g\ = 2 <j x 4 3^ 2 , 
and a (5,3)-curve, g 2 = 3 g x 4- fy/ 2 , 

is identical with the matrix P of the self-transformation of n . The intersection 
matrix of A is 

<«■«»-(!5 3o)‘ 

The double curve d of ^ is a certain plane section, i.e. 

2d - 2 (g x + g 2 ) « 2{(5g[ -3pi) + (-3 g[ +2ft)} 



40 


T. G, Room 


The essential feature of the matrix K defined so that KDK 1 — P is that both K 
and K 1 are composed of integers. Thus the curves g\ on A are proper curves, and the 
generators (curves g 4 ) may be cut out (one at a time) by surfaces determined by these 
curves.* 

An example of a self-transformation of a quartie surface with two base curves 
which cannot be made to correspond to the transformation 2 is provided by the 
quartie surface with an ordinary node N , which was discussed in § 7 (i). The matrix 
corresponding to the involution determined by lines through N is 



Z 

!i 

! ! 
J&S 4*- 

Let 

Mr :)■ 

with 

ps — qr = e = ± 1 , 

be such that 

HDH 1 - N. 

Then 

2p = 3r-fs, %q ~ r + 3#, ps-qr — e. 

i.e. 

s 2 _ r s _ 2e\ 

This equation has no solution in integers, so that no matrix H of the type required 
exists. 


9. A TERNARY QUADRATIC FORM, AN1> INVOLUTIONS 
ON CERTAIN QUART10 SURFACES 

A general attack on the problem of solving the set of diophantine equations 

xrx'-r (i*4) 

for matrices of order 3, is not at present contemplated, but some indication of the 
possible solutions in certain oases may be obtained by constructing quartie surfaces 
with three base curves which have geometrically definable self-transformations. 

One such family of quartie surfaces may be established by making use of the 
following theorem: 

9* 1. If two curves are such that through any point of space, one and only one line can 
be drawn to meet the curves in distinct points , then one of the curves is a straight line , 
and the other is a rational curve of order m to which the line is (m — 1 y secant. 

Thus the most general involutory transformation of a quartie surface that may 
be determined by ‘ chords’ of some reducible curve on the surface, is that of a surface 

* A surface of order 10 through two curves gl cuts out four generators g x on A; a surface 
of order 10 through these four generators and three curves g[ f cuts A remdually in a single 
generator g r 



Matrices of integers associated with self-transformations of surfaces 41 


that contains a rational curve of order m and an (m — 1)-secant line of the curve. 
Let n be such a surface, with a self-transformation and on it take as base: 

c t : a plane section, 
c a : the line, 

c 3 : the rational curve of order m to which c 2 is (m — l)-secant, 

( 41 m 
1 -2 m -1 

m m — 1 — 2 

To any self-transformation of the surface corresponds a solution of the six 
diophantine equations 


^x i x i ~2y i y i ^2z i z j ^{m--l){y i z j ^y j z i )^m{z l x i ^z i x i ) + {x i y i ^x j y i ) - y ir (9*1) 
For a given curve a let v i — (a, c ?: [, 

then, by allowing a to degenerate into an appropriate set of v x lines, we find that the 
order »v x of Ma is* m>i . (2m + 3)y l - (2m + i ) v, - 3. V 

From this, and (1*4), and R 2 ~ I it follows that 

( 2m 4-2 - 2 m - 1 — 3\ 

2m*f2 — 2 m —1 — 3J4-1. 

2m -f2 — 2 m — 1 — 3/ 

The recurrence relation derived from the characteristic equation of R is 

»r+B - «r+a - »r+ 1 + n r = 0 («*2) 

(whereas the 'true' recurrence relation is of course n H . 2 ~n r — 0). 


Next ({&c it Cj}) = Rr = 

The important figures are 


/3m 4“ 11 
j 3m 4- K 
\4m 4- 7 


3 7n 4- 8 4m 4- T 
3m 4- 5 4m 4* 0 
4m 4- 6 3m 4 5) 


Thus 


{*c„ Cj} = 4m, 4- 7, {*c s> c 2 } — 4m 4- 6. 


9*2. The curve 3fc 2 is a rational (4m 4- l)-ic to which the line c 2 is (4m 4- 6)-secant, 
and the surface consequently has a second self-transformation, :$ v determined by /#e a 
and c 2 , and thence a sequence of self-transformations each determined by r 2 and the 
transform of the fundamental curve in the preceding member of the sequence . 

Replacing m by 4m 4- 7 in the matrix R we find 

/ c x \ /8m 4~ 17 -87/1-15 -3\ / q \ 

c 2 J = 1 8m4“ 16 — 8m — 14 — 3j ( c a ). 

W cj \8m 4-16 - 8m - 15 -2/ \(HcJ 


* Suppose that r a 4 - r a ; then let a consist of v x — q — r a lines Zj not meeting c a or q, 
r a lines l t meeting c a only, and v s lines Z, meeting c* only. Let /q he the order of the 
residual curve out out by lines which meet a line Z* and c a and c a , then it may easily be 
proved that /q = 2 m 4 - 3, /q = 2, /q = 2m, ho that the order of the residual curve cut out by 
lines meeting a, c v c* is (2m4* 3) (— v % - r a ) + 2q 4- 2mr s . The case iq < v % 4 - r a may be dealt 
with similarly by allowing some of the lines to meet both c g and c a . 



42 

But 
so that 


T. G. Room 


3tc z - (2m + 2)c 1 -(2w+ 1 )c 2 “2c 8 , 

( c x \ /2m +11 — 2m - 12 6\ /c A 

c a J = |2m+10 -2m-11 6|(c fi J. 

oj \ 0 0 1/ W 

The result ^ > 1 c 3 = c 8 is to be expected, since ^c 3 is the curve cut out by tangents 
to 7 t along e s that meet c a . Tlius, in c 8 is part of the curve cut out by the lines 
which meet c 2 and ^?c 3 , and touch n elsewhere. In effect, c 3 is part of the double 
curve oim v 

The curve 8% X L S?c 3 is of order 4(4 m + 7) + 7 ~ 16m + 35, and to it c 2 is (16m- + 34)- 
secant. For the involution determined by these we find 

/cA /20th + 37 - 20m - 32 - 12\ /cA . 

#Jc a ) * I 20m+ 30 -20m-31 -12J jc 2 j 

\cj \ 10m + 18 - 1 Om - 16 — 5/ \c 3 / 

For the matrices corresponding to Jfj, J? 2 , we find 



/12m + 21 

-12m- 17 

— 9' 


R,R = RR 2 = (12m +20 

— 12m, — HI 

-9 


\ 2 m + 2 

— 2 m — 1 

— 2, 

so that 





This relation is true of any three successive transformations in the sequence, thus 
9*3. If is a transformation of the sequence defined in theorem 9*2, then 

&nt = 

and all transformations of the sequence may be expressed as products of $ with powers 
of .¥* - 

We have, in fact, two sequences of transformations of n: 

c/>n ( _ which are not involutory, 

and $.9 f ' n (— &-*{%) which are involutory. 

From the characteristic equation of S we find the recurrence relation for numbers 
associated with a sequence of curves 

... a, Sfa, y 2 a, ... 

to be* n r + 8 - 3n r+2 + 3 n r+1 - n r = 0, (9-3) 

* This is the same as the recurrence relation for one of the sequences of transformations 
of a quartic surface on which there is a twisted cubic and a non-incident line. This surface 
(see, for example, Room (unpublished) parts 2, 4 appendix II, and 5) is determinants! and 
has two distinct sequences of self-transformations, derived from the Schur self-transfor¬ 
mation, and the involution determined by chords of the cubic. With a plane section, the 
line, and the twisted cubic as base curves we find 

( 4 1 3\ /13 5 — 8\ / 9 0 -5\ /9 -5 -3\ 

1 -2 Oj, T = | 8 4 —51, U={ 4 -1 -2j and TU= j 8 -4 -31. 

3 0-2/ \ 0 1 0/ \16 0 -9/ \4 -1 -2/ 

The recurrence relation for TU is (9-3). 



Matrices of integers associated with self 4 raiisformatio 7 is of surfaces 43 

and in consequence n r = A + j3r + CV 3 , 

where the values of A , B, 0 are determined by the particular sequence under con¬ 
sideration. The elements of the powers of S form such sequences, and we find in 
this way that 


/9m+ 21 

- 9m- 21 

°\ / 

3m-1 

— 3m+ 4 


f 9m + 21 

— 9m — 21 

0 H 

3m-1 

— 3m 4 - 4 

- 9 ) 

\3m + 7 

-3m- 7 

0 / \ 

~ 771 — 5 

m+ 6 

-3/ 


These and the matrices RS r provide some of the solutions to the diophantine 
equations (9*1). 

The transformations*^" and are not necessarily all the self-transformations 
of the surface as the following examples show: 

m — 1. There are two separate sequences of transformations (and all their 
products) obtained by making each of the lines in turn play the part of c 3 . 

m ~ 8. There is a further involutory transformation determined from chords of 
the twisted cubic c 3 as in § 8. 

m = 4. The curve r a is a rational quartic and is a trisecant of it so that c 2 + c 3 
lies on a quadric. Thus 2c x — c 2 — c 3 is a twisted cubic and determines a further 
involution as in § 8. 

m — 5. The curve c 9 is a rational quintic, and c 2 is its single quadrisecant line; 
Cjj + Cg is a sextie of genus three, and the quartic surface is thus determinantal, and 
has the sequence of self-transformations corresponding to those discussed in § 6 (ii) 
(see Room, unpublished, part 3). 

The writer wishes to acknowledge the help derived from the many useful 
suggestions arid criticisms of Mr F. Chong of the University of Sydney. 


References 

Hodge 1937 J. Land . Math . Soc, 12, 58-63. 

Jessop 1916 Quartic surfaces. Oamb. Univ. Press. 

Landau 1927 Vorltwmgen iiber ZaMcntheoric. Leipzig: S. Hirzel. 

Room (to be published in Proc. Lond. Math. Soc.) ‘ Self-transformations of determinantal 
quartic surfaces.’ Parts 1 to 5. 

Schur 1882 Math. Ann . 20 , 254—290. 



Solar radiation in the radio spectrum 
I. Radiation from the quiet sun 


By I). F, Maktyn, D.Sc., Australian Council for Scientific and Industrial Research 

(Communicated by Sir David Rivett , F. R.S.—Received 20 June 1947 
—Read 29 January 1948) 


The theory of the emission of thermal radiation from the solar envelope at radio-frequencies 
is worked out in detail. The Lorentz theory of absorption is used in conjunction with Kirch- 
hoff’s law to derive the effective temperature of the various regions of the solar disk over 
the radio spectrum. A maximum effective temperature approaching 10® "G is found in the 
vicinity of 1 m. wavo-longth. Limb hrightoniug occurs at centimetre wave-lengths. It is 
shown that Gaunt’s quantum mechanical expression for free-free emission yields results 
almost identical with the classical treatment, provided Chapinau and Cowling's expression 
for the collision frequency in a fully ionized gas is used in the latter treatment. It is sug¬ 
gested that it may be preferable to treat problems of solar and galactic radio noise by 
classical methods, particularly when the refractive index of the medium departs appreciably 
from unity. 


1. Introduction 

The great advances in radio technique in the past few years have, theoretically, 
made it possible to detect solar radiation on wave-lengths of the order of centi¬ 
metres. At such wave-lengths Planck’s formula for black-body emission degenerates 
to the Rayleigh -Jeans formula, and the emissivity of a black body becomes pro¬ 
portional to TA 2 per unit frequency increment, T being the temperature of the 
source, and A the wave-length of the radiation. The first successful observations of 
such emission from the sun appear to have been made by Reber (1944) and South- 
worth (1944). The latter deduced from his observations a value of T about 0000° C 
at a wave-length of about 10 cm., agreeing with the well-known temperature found 
in the visible region of the spectrum. Subsequently, however, Southworth (1946) 
corrected his results and deduced a temperature of 20,000° C. 

In the meantime, a number of investigators, notably Appleton (1945), and 
Pawsey, Payne Scott & McCready (1946), had reported strong but variable emission 
from the sun at considerably longer wave-lengths, of the order of metres. Appleton, 
and also Pawsey et al suggested that such emissions, which appeared to correlate 
with sunspots, could not well be thermal in character, since such an interpretation 
would require the postulation of impossibly high values of T on the sun. 

This paper (I) is concerned only with the discussion of that component of solar 
radiation which appears to be thermal in origin. It is hoped to discuss the more 
highly organized radiation which appears to be correlated with sunspots in 
part II. 

[ 44 ] 



45 


Solar radiation in the radio spectrum 

The recent investigations of Edl&i (1942) and Alfv&i (1941) have shown that 
coronal matter is normally at a temperature approaching one million degrees. It is 
obviously important, therefore, in interpreting the observations, to have a clear 
picture of the region of the sun (photosphere, chromosphere or corona) from which 
the observed radiation at any selected wave-length is emitted. To achieve this, 
a reasonably accurate knowledge of the electron-pressure distribution in these 
regions is necessary. Fortunately, such information is available from several sources. 
Bamribaeh (1937) gives what should be reliable estimates of the electron densities 
in the corona, since they are deduced directly from observations of light scattering 
by these particles. Cilli6 & Menzel (1935) have made a theoretical investigation of 
the electron pressures in the chromosphere. It was found in the course of the present 
investigation that radiation in the radio spectrum does not originate in regions lower 
than this. 


2. Radiation from an electron-ion atmosthterk 

The problem of the depth from which solar radiation of given frequency issues 
has already been encountered by solar physicists. Milne has introduced the useful 

concept of optical depth r given by r ~ teds, where k is the absorption coefficient 

J 

and ds an element of path along the selected ray. He has shown (Milne 1930) that 
the radiation observed from a slab of material comes from an average optical depth 
r = 1 measured along the ray. This concept is adequate for conditions encountered 
in the visible spectrum. It is found below, however, that over a large part of the 
radio spectrum the refractive index // of the medium is considerably less than unity. 
Moreover, the electron-collision frequency v is so small that it is possible at certain 
wave-lengths for incoming radiation to penetrate the solar envelope to the level 
// ™ 0 without serious absorption. In other words, at certain (radio) wave-lengths 
the solar envelope is a reflector. Under such conditions it may be impossible for 
incoming radiation to reach the optical depth unity with appreciable intensity, and 
Milne’s criterion cannot be applied. 

It is a well-known fact in the theory of radio propagation in the ionosphere that 
under these circumstances most of the absorption occurs near the level ft = 0, where 
the ray is strongly deviated (by angle rr for the presently assumed case of vertical 
incidence). It will be sufficient then to replace Milne’s criterion in such cases by the 
criterion that the observed radiation comes from the level ft » 0. In these eases, 
however, the radiation emitted at wave-length A will no longer be the full radiation 
corresponding to the temperature T at this level. Since the medium is a reflector 
it will be necessary, in calculating the emission, to reduce the theoretical black-body 
emission by a factor which is the absorptive power, in accordance with KirchhofTs 
law. Those familiar with the theory of radio propagation in the ionosphere will 
readily recognize the distinction between the two cases described above. Milne’s 
criterion applies for Absorption in the non-deviating region’. The other treatment 
must be used for Absorption in the deviating region*. 



46 


D. F« Martyn 


2*1. The determination of the effective levels of emission 
2* M. Case (a). The atmosphere is a full absorber 
For this class of wave-lengths, which lies at the short-wave end of the radio spec¬ 
trum, it is necessary, as explained above, to determine the levels at which r — 1. 
Baumbach’s results (1937), which are described below, reveal that the eleotron- 
deneity gradient in the corona is exponential in form. Moreover, it is found that the 
exponential coefficient is constant over large ranges of radial distance from the sun. 
Since the corona is completely ionized and neutral, v must also have an exponential 
gradient, with approximately the same coefficient, ho long as the temperature change 
is small over the range under consideration. 

Hence N = A r 0 e~^ r 

and v — r 0 c~^ r , 


where N is the electron density at distance r from the sun’s centre, and N 0> v 0 , fi are 
constants to be determined later from Baumbach’s results. 

Absorption of a ray in a medium of this type has been previously worked out 
(Martyn 1935 a). From the latter treatment* it follows immediately that 


where 


cofl 2 6 * pN Ip 2 cos 2 i , 


i being the angle of incidence of the ray, pj2n the wave frequency, c the velocity of 
light, and p = 47rr 2 /m, e, m being the electronic charge and mass. Writing ‘a * for 
3c/?pA 7 0 /2v 0 p 2 cos 3 t, the condition for r ~ 1 is 


sin 3 0 - 3 sin#+( 2 -~a) = 0 . * ( 1 ) 

The solution of this cubic equation gives the value N' of N at which r = 1, when the 
frequency and angle of incidence are specified. Over most of the range to be in¬ 
vestigated it turns out that a 1. In these circumstances the solution of equation 

( 1 ) 1N cos a 0 = 1 - 15 ^/a, 

and so N’ = b\5pcos*i*J(*3 c/JNJ 2 pVq). (2) 


2 * 1 * 2 . Case (b). The atmosphere is a partial reflector 

In these cases, which apply for the longer wave-lengths and shallower angles of 
incidence, the level at which an incoming ray is turned back has to be determined, 
since this is the region where absorption chiefly occurs. This level is readily fixed 
by an application of Snell’s law, together with the well-known expression due to 
Lorentz for the refractive index of an ionized gas, viz. 

p % = l—pN/p*. 

* In the paper quoted, absorption was expressed in ratios of eleotric field strength. In the 
present paper it is expressed in energy ratios, in accordance with optical convention. 



47 


Solar radiation in the radio spectrum 

At the turning point, by Snell’s law, 

fi\ = sin 2 i ~ 1 -pNJp 2 , 

and so A\ « p 2 cos 2 i/p. (3) 

For given p and i equation ( 3 ) determines the level from which the radiation is 
principally emitted, when the atmosphere is a reflector. 

2 *2. The determination of the absorptive power of the medium 

The absorptive power A of the medium is, by definition, (1—e~ 2r i), where r x is 
the optical depth of the turning point of an incoming ray. Clearly, for wave-lengths 
such that the medium is a full absorber, A is unity. 

When the medium has appreciable reflecting power it is necessary to know r v in 
order to calculate the emission by Kirchhoff’s law. It has previously been shown 
{Martyn 1935 a) that for a medium with exponential gradient 

Tj = 4 r 0 cos 3 ip 2 / 3 cp/ 3 N 0 . ' ( 4 ) 

2 - 3 . The effect of a magnetic fields on the emission levels 
2 * 3 * 1 . The absorption levels 

When the presence of a magnetic field // is taken into account the atmosphere 
becomes doubly refracting, and for each frequency there are two levels of effective 
absorption. The appropriate theory for a uniformly ionised medium has been given 
by Appleton (1932). The equations expressing the two modes of propagation (the 
‘ ordinary ’ and c extraordinary ’ rays) are cumbersome, but fortunately it has proved 
possible to make adequate approximations for the purposes of this paper. In 
particular, the fact that v/p 1 over the region of the radio spectrum under examina¬ 
tion greatly simplifies the equations. Another simplification arises from the fact 
that the gyro-frequency w (= He/m) is found to be less than p in the absorbing 
regions of the solar envelope, if H be given the value appropriate to the sun’s general 
magnetic field. 

Over all but a very limited range of values of i the treatment given in (2*1* 1) and 
(2*1*2) remains valid for the ordinary ray. For the extraordinary ray, however, it 
becomes necessary to replace ‘a’ in equation (]) by a(l —;*/)/( 1 — | y L |), where y is 
a)jp , and y L is the component of y along the direction of the ray, and to put 

cos 2 6 = pN jp 2 {l-y) cos 2 i . 

(The basis of these approximations is due to Booker (1935).) Over most of the range 
to be investigated an approximation similar to equation (2) is valid for the extra¬ 
ordinary ray. Thus 

N” = 1 * 15(1 — j/)* (1— y L )* poos* i *J( 3 cfiNJpv 0 ). 

This gives, for the extraordinary ray, the value of N for which r — 1. 


(5) 



48 . D. F. Martyn 

When the atmosphere is a reflector a treatment similar to that in (2*1*2) gives 

N 2 = p a cos 2 i(l ±y)jp , (b) 

the minus sign applying when p > to, and the plus when p <to. A complication occurs 
in these cases, as compared with the ordinary ray, N being no longer simply pro¬ 
portional to p or p % . This complication is usually unimportant in equation ( 5 ), 
since it turns out that for a wide range of the high frequencies here involved y<$, 1, 
so long as H has the magnitude appropriate to the sun’s general magnetic field. For 
the lower frequencies involved in equation (6), however, y is found to be of the order 
of 0 * 5 , and it becomes necessary to solve the equation rigorously as a quadratic in p , 
bearing in mind that to and N are functions of r, the distance from the sun's centre. 

The solution of equation (( 5 ) is 

2 p = T to ± *J(to 2 4- A^/cos 2 i), 

in which the plus and minus signs may be taken in any combination. Clearly, 
however, the negative sign before the radical must be discarded as it can only lead 
to negative solutions. This leaves two possible solutions for p. It turns out that for 
the values of N and to appropriate to the corona and chromosphere (§ 3 below) both 
solutions give p>to at all levels. This rules out the use of the minus sign in front 
of to. The only* valid solution is then 

2 p = + ^/(w 2 *f 4 pIV r a /cos 2 t). 

This result is particularly satisfactory, as the approximation involved in equation 
(6) is rather crude w hen y > 1. 

In practical computing a series of values of r are selected, N and to being known 
for each value of r. Representative values of i (20, 40 , 60 , 80 °) are taken, and 
equation (6) solved forp for each pair of values of r and i. By graphical interpolation 
the values of N 2 are then determined for any pair of values of p and i. 

2 * 3 * 2 . The absorptive power of the medium 

From the discussion in (2*3*1) it follows that the optical depth r a of the turning 
point for the extraordinary ray is given by 

r a *= 4 y 0 cos 3 ip 2 (l — y)/ 3 cpfiN 0 ( 1 -y L ). ( 7 ) 

2 * 4 . The validity of the application of Kirchhoffs law to 
a magnetically anisotropic medium 

In proving his law, Kirohhoff made use of the reciprocity theory of Helmholtz. 
It is well known that theorems of this type break down when applied to magnetically 
anisotropic media. It seems desirable therefore to examine carefully the application 
of Kirchhoff’s law to the solar envelope in the manner propounded in the present 
paper. The proper application of the law requires the specification of the plane of 
polarization. 

* This conclusion differ from that of Saha (1946), who considers that both signs are 
permissible when pxo. Most workers, including the present author, appear to consider that 
heavy absorption prevents the occurrence of the second extraordinary component in such cases. 



49 


Solar radiation in the radio spectrum 

Consider a beam of radiation plane polarized in a direction x at right angles to 
the direction of propagation, and let it be incident upon the medium with an ampli¬ 
tude E &'mpt. Let y be another direction at right angles to x and to the direction of 
the beam. The plane-polarized beam may be split up into two equal circularly 
polarized components having opposite senses of rotation, thus: 

(i) E x = E/ 2 . sin tot, E u = E/ 2 . cos tol, 

(ii) E x = E/ 2 . sinw*, E y = - E/ 2 . cos cot. 

The first of these components may be considered to be propagated through the 
medium as the ordinary component, and the second as the extraordinary component. 

When the medium varies sufficiently slowly in its properties (a condition which 
certainly is satisfied in the solar envelope), Booker (1936) has shown that these 
two components can preserve their identity, each emerging after refraction with a 
characteristic polarization. For conditions where v<tp>o) this limiting polariza¬ 
tion is very nearly circular for all but a very narrow range of angles between the 
emerging ray and the magnetic field direction (Martyn 19356)- Assuming circular 
polarization, the emerging intensities may be written as 

(iii) E x , = p^E /2 . sinaif, E u > ~ P\E /2 . cos rat, 

4 

(iv) E x/ « p 2 E/ 2. sinwJ, E y > = —p 2 E/ 2 . cos a>t f 

where p l and p z are the attenuations of the ordinary and extraordinary components 
respectively. The total average energy in the emerging ray is therefore proportional 

t0 {(a+/>,)*+(^- ft) W« - (pI+pD^I*. 

Since the average energy in the incident ray is similarly proportional to E % j 2 it 
follows that the absorptive power is 

A -H(l-P?) + (1 -Pin- 

Expressing Pi g, which are ratios of field intensities, in terms of optical depths, 
this becomes ’ A-M~ «-**) + (1 - *-*■)}. 

By Kirohhoff’s law it can now be asserted that if the emissive power 1 of the medium 
be resolved into the two plane-polarized components I x and I u , then 

/, = \AB } , 

where B f is the black-body emission at frequency /. Now A is clearly independent of 
the direction ohoBen for x, since it is a.function only of the optical depths of the two 
independent circularly polarized components. It follows that 

and I~lB,[Ai+A a ), 

where A i a are the absorptive powers of the medium for the two components. 

It seems clear that, for the particular conditions specified above, which result in 
the limiting polarizations being circular, Kirchhoff’s law can be applied separately 


Vol. 193. A. 


4 



60 D. F. JMartyn 

to each component. The total emisBion will then be half the sum of the two unequal 
circularly polarized components so calculated. 

It is satisfactory to note that Eckersley (1940) has experimentally verified the 
reciprocity theorem for conditions similar to those specified above. It seems likely, 
however, that the conclusions reached above would become invalid in cases where 
p < (o and the limiting polarizations of each component become markedly elliptical. 


3. The physical conditions in the chromosphere and corona 

For the theoretical deduction of thermal radiation from the solar envelope it is 
necessary to know the temperature distribution, the pressure distribution, the 
degree of ionization at the various levels, and the proportions of the constituent 
gases. Available knowledge regarding these quantities in the corona has been sum¬ 
marized recently (Alfven 1941). Edl£n*s identification (1942) of the coronal lines 
as due to highly stripped metallic atoms leaves little doubt that the coronal matter 
is at a high temperature. Alfv6n has supplied additional lines of reasoning, notably 
that of the large ‘scale-height ’ of the electron atmosphere in the corona, which make 
it reasonably certain that the temperature approaches one million degrees in this 
region. Since the surface of the photosphere is at approximately 6000 ° C there must 
be a fairly sharp rise of temperature on passing through the chromosphere and inner 
corona. The observations of Redman (1942), who deduced kinetic temperatures in 
the chromosphere of order 30 , 000 ° C from a study of spectral line profiles, now fall 
into place. The temperature distribution used in the present paper is based on 
Alfven’s work for the corona, and on Redman’B observations for the chromosphere: 
it is given in figure 1. 

The distribution of electron density is known from the work of Baum bach (1937), 
who has analyzed the coronal isophotes from a number of eclipses. Baumbaoh’s 
results are shown also in figure 1. They do not extend, of course, so far inwards as the 
chromosphere, and here appeal is made to Cillie & Monzers theoretical calculations 
(1935). The two sets of results have been joined smoothly, the junction occurring 
in the vicinity of N =* 4 x 10 8 electrons per c.c. It seems unlikely that Baumback’s 
results will be much modified by subsequent investigations, so long as it be remem¬ 
bered that they represent average conditions only. On the other hand, Cilli£ & 
Menzel took no account of the probable rise of temperature in the chromosphere, 
and their results may have to be modified subsequently. For the purposes of this 
paper the quantitative uncertainty here is probably unimportant. 

The constituents can safely be assumed as a Russell mixture, viz. 28 parts of H,* 
1 part of O type atoms, and 1 part of metals. At the coronal temperature the hydrogen 
must be fully ionized, O type six times ionized, and metals ten times ionized. The 

* Note added, 22 January 1948. Recent developments in solar physics suggest that the 
proportion of H may be much greater than here assximed. If this proves to be the cose 
then the values of v calculated below must be reduced by a factor between 1 and 5. This 
would produce only small displacements in the curves of figures 3 and 4. 



51 


Solar radiation in the radio spectrum 

contribution of each type of ion to the electron-collision frequency is shown below 
to be proportional to n x e \, where n x is the number density of the ion, and e x its charge. 
Hence 

H contributes 28 x 1 = 28 ' 

0 contributes lx G 2 - 36 - total 164 . 

Metals contribute 1 x 10 2 = 100j 

It will be sufficient, therefore, to work out the eleotron-oolliaion frequency with 
metallic ions and to increase the value obtained by two-thirds in order to allow for 
the presence of the other ions. 


1000 


100 


10 


1 

0 10 10 *2 x 10 10 3 * 10 10 

height above photosphere ((r — r 0 ) cm.) 

Figure 1. X, distribution of electron densities (number of olectrons/c.c. x 10 7 ); v, electron 
collision frequency (collisions/sec.); T, temperature (°Kx 10— a ); ot, gyro-frequency (radians/ 
sec. x 10 - *), in oorona. 



Alfven has given values of v deduced by assuming a collision cross-seotion of 
10~ 17 om. a . This does not take account of the fact, however, that the coronal matter is 
highly ionized, so that there are strong attractive forces between the electrons and 
heavy particleB. The importance of this factor in the solar atmosphere has been 
stressed by Cowling (1945). It is readily deduced from the latter’s work (equation 
( 35 ), loc. cit.) taken in conjunction with that of Chapman & Cowling (1939) on this 


type of gas, that 


2 j( 2 n)n l e 2 e{ 
3 m*(lcT)* 


A 1 ( 2 ), 


( 8 ) 


where 



( 4 dkT)* \ 
««i 1 


k is Boltzmann’s constant, and d is the mean distance between pairs of neighbouring 
ions and electrons. A x { 2 ) is, therefore, a slowly varying function of temperature and 
pressure; it has the numerical values 25 in the corona, and 11 in the chromosphere. 


4-2 






52 


D. F, Martyn 

It is of interest to compare the value of v calculated from equation (8) for a given 
level in the corona, with that given by Alfv6n for the same level. At the level 
r = 7*71 x 10 10 , figure 1 shows that N = 1*55 x 10 8 cm.~ 3 . The Russell mixture gives 
44 electrons for one metallic ion, so that — 3*52 x 10 6 cm.~ 3 . Inserting the other 
numerical values in equation (8), it is found that v = 20-4 collisions per sec. This has 
to be increased by two-thirds to v = 34 per sec. in order to allow for the other types 
of ion present. This is to be compared with Aifv6n’s estimate, v — 2-2 per sec. at 
the same level. It is clear that neglect of the electrostatic forces is not permissible. 
The distribution of v in the corona and chromosphere deduced in the above manner 
is shown in figure 1. 

In calculating the absorption and locating the emission levels by equations (1), 
( 2 ), ( 4 ) and ( 7 ) it will be observed that the solar variables appear always in the 
relationship ftNJv {) . It is clear from equation (8) that NJp q will vary markedly with 
T, so that this ratio will have very different values in the corona and chromosphere. 
The scale height 1 jft varies, however, in such a way as to keep ftNJv 0 roughly con¬ 
stant. Thus, the latter quantity is found to have the numerical values 2*4 x KM in 
the outer corona, 5*6 x 10 4 in the inner corona, and 8 x 1 CM at the base of the 
chromosphere. The small variation of ftNJv 0 over such a wide range of conditions 
is justification for the use of the simple formulae in equations ( 4 ) and ( 7 ) for com¬ 
puting the absorption over regions where ft and NJv 0 individually are changing 
appreciably. 

There appears to be little doubt that the sun possesses a general magnetic field 
whose influence extends into the corona. The strongly suggestive appearance of the 
polar tufts in this connexion is now supported by measurement (Thiessen 1946). 
This investigator finds the polar value of H to be 50 gauss. The corresponding values 
of the gyro-frequency are shown in figure 1 . In calculating these it has been assumed 
that the distribution of H in the solar atmosphere is similar to that in the earth’s. 

4 . Treatment of data 

For the purpose of calculating the solar emission the disk of the sun was divided 
into four or more annuli. These were chosen so that the radiation reoeived at the 
earth emerged at mean angles of 20 , 40 , 80 and 80 ° respectively to the solar radius. 
This leaves a small central disk for which i is approximately zero. For each frequency 
the levels from which the radiation emanated were determined by the use of equa¬ 
tion (1), In the lower range of frequencies, as explained above, r < 1, and the appro¬ 
priate levels were determined from equations ( 3 ) and (6). For high frequencies and 
small values of i (the inner annuli), it was usually possible to employ equations (2) 
and ( 5 ) instead of ( 1 ), thus greatly reducing the labour of computation. 

The levels determined for a range of the lower frequencies are shown in figure 2 
for the ‘ordinary’ and ‘extraordinary 1 radiation respectively. At the largest value 
of i ( 80 °) the results can only be rough, since towards the limb of the sun the approxi¬ 
mation involved in treating the surfaces of the annuli as plane will break down. The 



53 


Solar radiation in the radio spectrum 

condition /i sin i — constant, used for ray tracing, should here be replaced strictly 
by fir sin i * constant. It has not been thought necessary to treat this complication 
in the present paper. 



Figure 2 , Coronal emission levelw for solar radiation, ordinary (o) and 
extraordinary (e) at frequencies of 30 , 75 and 200 Meyo./sec. 

The complete procedure iri computing the radiation is now as follows. At the 
given frequency the average level from which the radiation emanates is determined 
for each annulus as described above. This determines the value of T for each annulus. 
When the level has been determined by the condition r — 1, then 7 T is also the effective 
temperature T e of this annulus, considered as a full radiator at the chosen frequency. 
For values of p and i which give r < I at this level, the absorptive power (1 — e~ 2r ) is 
computed from equation (1). In such eases, the effective temperature of the annulus 
considered as a full radiator becomes T( 1 -c‘ 2t ) by KirchhofF’s law. 

Proceeding in this manner the values of T r for a number of frequencies were com¬ 
puted for the ordinary radiation. The results are shown in figure 3 for various 
frequencies selected to illustrate all the typical distributions of r L\ across the disk, 
within the radio spectrum. In figure 3 , T e is plotted, for each frequency, as a function 
of the distance from the centre of the solar disk. It should be noted that in this 
figure r 0 , the solar radius, lias not necessarily the value measured by optical methods. 
The precise definition of r 0 would require a higher degree of approximation, at the 
glancing angles from the solar limb, than is used in this paper. It is observed that 
the lower frequencies show 'limb darkening’, although, of course, this occurs for 
a physical reason quite different from that which explains the same phenomenon in 
the visible region of the spectrum. At wave-lengths in the neighbourhood of 1 m. 
the disk is uniformly bright, but at wave-lengths only slightly shorter, pronounced 
4 limb brightening’ appears. This, of course, is due to the fact that the radiation in 
the middle of the disk is emanating from the chromosphere, which is at a much lower 
temperature than the corona. At the glancing angles from the limb the radiation 
still comes from the corona. 



54 


D. F. Martyn 

It is now an easy matter to compute for each frequency the average value of T f 
over the whole disk. This was done by assigning to each annulus a weighting factor 
proportional to the projected area of the annulus perpendicular to the line of sight. 

A = 

30 m. 


10m. 


l*5m. 


1 m. 


60 cm. 


40 cm. 


20 cm. 


0 0-2 0-4 0-6 0-8 1 0 

r l T o 

Figure 3. Variation of T, across solar disk at various wave-lengths (r = r 0 at limb). 

Figure 4 shows the resulting distribution of T e for the range of frequencies under 
consideration. T e is seen to have its maximum value at a wave-length of about 1 m. 

The ‘extraordinary’ emission has been derived from the above computations of 
the ordinary emission by the following approximate treatment. 

Consider first the higher frequencies where equation (2) is applicable to the 
ordinary radiation and equation ( 5 ) to the extraordinary. Since the solar envelope 
is a full radiator for these cases the only difference in emission arises from the different 
temperatures at the different emission levels for each component. At high frequencies 
(A < 10 cm.) the acourate solution of equation (6) shows that y<l at the turning 
point. Sinoe the level t = 1 is above this point, and w decreases upward, it follows 




Solar radiation in the radio spectrum 55 

that y<^ 1 at the level of emission. The ordinary and extraordinary radiations are 
therefore equal at such frequencies. For wave-lengths of 1 m. and upwards, accurate 
solution of equation (0) shows that y varies very little from the value 0 * 5 . A weighted 
average value for y L over the disk at A = 1 m. is about 0 * 17 . Hence (1 —t/)* (1 — 
is 0 * 65 . The extraordinary emission at frequency p is therefore equal to the ordinary 
emission at frequency 0 * 65 p. For the lower frequencies (A> 1 m.) the temperature 
at the emission levels for the ordinary and extraordinary radiation are practically 
the same, but the emission differs because of the different absorption coefficients 
expressed in equations ( 4 ) and ( 7 ). Proceeding in a mamier similar to that described 
in the preceding paragraph it can readily be shown that the extraordinary emission 
at frequency p is equal to the ordinary at frequency 0 * 77 p. In this manner the 
extraordinary radiation has been derived from the previously computed ordinary 
radiation. It is shown by the dashed line in figure 4 . 



wave-length 

Figure 4. Effective temperature of the sun considered as a full radiator, in the radio spectrum, 
for the ordinary and extraordinary components. The total emission at wave-length A is pro¬ 
portional to [T e (ord.) -f T e (extraord.)] xA~*. -Ordinary radiation,-extraordinary 

radiation. 


5. Comparison oit theoretical results with observations 

The value of T (20,000°) at 10 cm. measured by Southworth (1944) appears to 
lie on the theoretical curve of figure 4 . At a wave-length of 15 m. Pawsey (1946) 
finds a background of noise which he interprets as thermal radiation corresponding 
to T = 10® degrees. Dicke & Beringer (1946) found T * 10 4 degrees at 1 cm. All 
these results agree satisfactorily with the theoretical curve in figure 4 . Similar obser¬ 
vations at wave-lengths near 50 cm. would be of considerable value in testing the 
theory. At the longer wave-lengths (> 1 m.) it may prove difficult to separate the 
thermal radiation from that associated with sunspots. 




56 


D. F. Martyn 

Observations of the variation of brightness across the solar disk would provide 
a valuable test of the theory. Covington (1947) has studied the effect of the partial 
solar eclipse of 23 November 1946 on solar noise at a wave-length of 10*7 cm. When 
the radiation from an active spot group has been eliminated from his results it 
appears that the curve of diminution of noise during the eclipse is considerably 
shallower than would be expected theoretically if the noise emanated uniformly 
from the disk. There is also some evidence of an unexpected large drop in intensity 
near first contact. Both these results are to be anticipated if the above conclusions 
regarding limb brightening at such wave-lengths are correct.* 


6. Radiation deduced from Kibohhoff’s law-compared with 

THAT FROM THE THEORY OF FREE-FREE TRANSITIONS 

Re her (1940) interpreted galactic radio radiation as due to free-free transitions of 
electrons in the field of protons in interstellar space. This idea was further developed 
by Henyey & Keenan (1940), and has been applied to solar radio noise by Green- 
stein (1947). The latter has expressed doubts about the validity of the Kirchhoff 
treatment (Martyn 1946), which is used also in the present paper, and advances 
arguments tending to show that thermal solar radiation will not decrease at low 
frequencies as is indicated in figure 4 . 

Since the problem is essentially a classical one (hf^kT), the correspondence 
principle should apply, and both treatments should give almost identical results. 
The present section is devoted to testing this point, and to a consideration of the 
relative advantages of the two treatments for calculating emission. 

The classical absorption coefficient k used in the present paper is due to Lorentz 
and is given by 


K = 


v Ne? 
cfi 7 rmf' 


2* 


( 9 ) 


This expression for k has been used in the present paper to describe absorption 
of energy both in the deviating region (fi 4 , 1) and in the non-deviating region (ji == 1). 
Workers who have applied quantum mechanics to the problem do not appear to 
have yet considered media where N is large enough to cause /i to depart appreciably 
from unity, so, for comparison with their results, fi is put equal to 1. A neutral 
electron-proton gas is considered. In the above expression for k , v is now eliminated 
by the use of equation (8). This equation is valid only for binary encounters, and for 

( 4 dkT)* 

an electron-proton gas it is readily verified that under such conditions —\ t 
and d 4 = ll( 2 N) k . Hence 


k (Lorentz, Chapman) « 


4^2 AV j 4kT \ 

3^nc(mkT)*f* i0ge {e*(2N)ij ’ 


( 10 ) 


♦ Results similar to Covington’s, for another eclipse, have just been reported by Sander 
(Nature, 12 April 1947 ), who considers that they show limb brightening. 



57 


Solar radiation in the radio spectrum 

The above expression has been derived on purely classical lines from simple con¬ 
siderations of the absorption of a plane wave due to collisions (Lorentz), combined 
with Chapman & Cowling's first approximation to the diffusion coefficient in a fully 
ionized gas. In comparing it with that derived by quantum mechanical methods it 
is necessary to give precision to the definitions of intensity of radiation, etc., and to 
consider carefully the form of the radiation field. For this purpose, the definitions 
and terminology used by Milne (1930) are followed throughout. The equation of 
transfer of radiation can be written (Milne 1930, p. 161 ) as 

, 7 ,+ £.•<,+ /,)■ (U) 

where <x f is the absorption coefficient of an electron-proton pair in the state \ I f is 
the intensity of radiation, ds is an element of path along the direction considered, 
n are the numbers of ion pairs per c.c. in the absorbing and emitting states 
respectively, and q\ q" are the statistical weights of these two states, andrr = 2hpjc*. 

This equation is to be compared with that derived on the simple Kirehhoff 
theory, viz. 

dl, 

( lg ~ —kIj + kH/. ( 12 ) 

(This simply expresses the fact that the gain of intensity along the pencil of 
radiation is the difference between the intensities emitted and absorbed.) 

For comparison with the classical absorption coefficient it is clear that k is the 
quantity to be determined. For when plane waves are considered IpB (I becomes 
infinite along the direction of propagation of a plane wave, but this, of course, 
presents no difficulty in practice, since such a wave is physically unrealizable). 
Hence, for plane, or nearly plane, waves, the solution of equation (12) becomes 

showing that tc is the quantity to be compared with the Lorcntzian absorption 
coefficient. For conditions of local thermodynamic equilibrium equations (11) and 
(12) must be identical, so that 

k — <x f (n' — n tf q , lq n ). 

But by Boltzmann's equation e~ hflkT , 

so that k = n f a f ( 1 - er hflkT ). ( 13 ) 

Now, when summed over all the (continuous) absorbing states, is the true 
absorption coefficient of the medium. At radio-frequencies, however, the quantity 
in round brackets in equation ( 13 ) is very small (about 10 ?), so that k is very much 
smaller than the true absorption coefficient. The physical significance of this is 
clear. At these frequencies, the Einstein stimulated emission is much larger than the 
spontaneous emission. Moreover, as Dirac (1926) has. pointed out, the stimulated 



58 


D. F, Martyn 


emission must have the same direction as the stimulating field. Hence, at radio¬ 
frequencies, the effective absorption coefficient is very much less than the true 
absorption coefficient, owing to the presence of the large stimulated field. 

The emission from the encounter of electrons and protons was first calculated 
by Kramers (1923) on classical lines. Gillie (1932) gave a quantum treatment. The 
latter’s expression for the total emission is 


2ViV a 

(6tt)* 


^ e ~ hmTd ^ c - c - laec - 


This must be divided by 477 in order to obtain the intensity of emission The 
later wave-mechanical treatment of Gaunt (1930) adds a correction factor g, given by 


g = ^log e ( 3 kT/ 2 hf). 

77 


There has been some controversy whether the above expression for E f includes 
the stimulated emission. Eddington (1923) has argued that it does, but Gaunt 
advances powerful reasons to the oontrary. It will be found below that agreement 
between the classical and quantum treatments of the present problem is possible 
only if Gaunt’s view is correct. 

Assuming the latter viewpoint, the intensity of the unstimulated emission can 
be written as 


E f 


V3 2 fi 7riV* 
(6*)* 




( 14 ) 


But, by equation (11), 


( 15 ) 


Henoe, by equations ( 13 ), ( 14 ) and ( 15 ), and remembering that hf/kT 1 

k (Kramers, Gaunt) = ^^^kYj*f^° ge < 3A7 V 2A /)- ( 16 ) 

The agreement between the classical and the quantum expressions for k is clearly 
very close, as it should be. The absorption coefficients are identical save that the 
slowly varying function \og e ( 4 kTfe 2 ( 2 N)t) arising from the kinetic theory of an 
electron-proton gas, is replaced by the slowly varying function log e (ikT/ 2 hf) 
which arises from the Gaunt correction factor. Over the range of gas pressures, and 
radio-frequencies which are concerned in observations of galactic and solar radio 
noise, the ratio of these functions differs little from unity. Both formulae must break 
down at high densities, when encounters can no longer be considered as binary. The 
Lorentz-Chapman formula must break down also at very low densities and high 
frequencies since the Lorentz theory applies only when there are large numbers of 
electrons in a cubic wave-length. Over the range of conditions discussed in the 
present paper either treatment can be used. There are many advantages, however, 
in the use of the macroscopic concepts such as refractive index and absorption 




Solar radiation in the radio spectrum 59 

coefficient which are employed in the present paper. Since conditions are en¬ 
countered where // 1, any theory based entirely on atomic concepts would have to 

consider the combined effect of the scattered field from the assemblage. It would 
seem to be unnecessary to proceed in this manner in dealing with a classical problem 
for which classical methods are adequate and relatively simple. 

This work has been carried out as part of the research programme of the Council 
for Scientific and Industrial Research, at the Commonwealth Observatory, Canberra. 
It is a pleasure to acknowledge the many valuable suggestions and criticisms received 
from I)r R. v. d. R. Woolley and Dr C. W. Allen. The author also wishes to acknow¬ 
ledge the benefit from frequent discussion of solar noise problems with Dr J. L. 
Pawsey and the staff of the Radiophysics Laboratory of the Council. 


References 

Alfvon 1941 Ark. Mat . Astr. Fys. 27 A, Hufto 4, 

Appleton 1932 J. Instn Flee. Engrs , 71, 642. 

Appleton 1945 Nature , 156, 534. 

Baumbacb 1937 Astr. Nach. 263, 121. 

Booker 1935 Proc. Roy. Soc . A, 150, 267. 

Booker 1936 Proc. Roy. Soc. A, 155, 235. 

Chapman & Cowling 1939 Mathematical theory of non-uniform gases, p. 177. 
Camb. Univ. Press. 

Cilli 6 1932 Mon. Not. R. Astr. Soc. 92, 820. 

Cillte <fe Menzel 1935 Harvard Coll. Obs. Circ. no. 410. 

Covington 1947 Nature , 159, 405. 

Cowling 1945 Proc. Roy. Soc. A, 183, 453. 

Dioke & Beringer 1946 Aatrophys . J. 103, 375. 

Dirac 1926 See J. Woltjer, Versl. Akad. Amsterdam , 35, 888 . 

Eokertsley 1940 Nature , 145, 222. 

Eddington 1923 Mon. Not. R. Astr. Soc. 84, 104. 

Edl 6 n 1942 Ark. Mat. Astr. Fys. 28 B, Hafte 1. 

Gaunt 1930 Phil. Trans. A, 229, 163. 

Greenstein 1947 Observatory, 67, 22. 

Henyey <fc Keenan 1940 Astrophys. J. 91, 625. 

Kramers 1923 Phil. Mag. 46, 836. 

Martyn 19350 Proc. Phys. Soc. 47, 328. 

Martyn 19356 Phil Mag. 19, 376. 

Martyn 1946 Nature , 158, 032. 

Milne 1930 Hand . Astrophys . 3, pt. 1, 84. Berlin: Springer. 

Pawsey 1946 Nature , 158, 033. 

Pawsey, Payne Scott & MoCready 1946 Nature , 157, 158. 

Reber 1940 Proc. Inst. Rad. Eng. 28, 68 . 

Reber 1944 Astrophys. J. 100 , 279. 

Redman 1942 Mon . Not. R. Astr. Soc . 102 , 140. 

Saha 1946 Nature , 158, 549. 

Southworth 1944 J. Franklin Inst. 239, 285. 

Southworth 1946 J . Franklin Inst. 241, 167. 

Thiessen 1946 Observatory , 66 , 230. 



A possible mode of propagation of the ‘slow’ or tail 
component in atmospherics 

By A. L. Hales, The Bernard Price Institute for Geophysical Research , 
University of the, Witwatersrand 

{Communicated by B. F, J. Schonland , F.R.S.—Received 18 July 1947 ) 


In this paper a possible (surface wave mode of propagation for the alow or tail component in 
an atmospheric is discussed. Surface wave solutions for the propagation of electromagnetic 
waves between infinite plane conductors are set up and it is shown that if one of the layers is 
of finite conductivity there is a solution which corresponds qualitatively with the observed 
velocities and periods of the slow component. 


1. Introduction 

Oscillograms of atmospherics show three distinct phases (a) a leader process, 
(6) an 4 oscillatory * component and (c) a 'slow’ component. (Appleton & Chapman 
1937; Watson-Watt, Herd & Lutkin 1937.) In night atmospherics the oscillatory 
component takes the form of a series of pulses (Schonland, Elder, Hodges, Phillips 
& van Wyk 1940, figure 1, p. 185 ), and it has been shown (loc. cit.) that, as was origin¬ 
ally suggested by Laby et al. (1937), these pulses represent reflexions from an 
ionospheric layer at a height of approximately 90 km. 

For daylight atmospherics the pulses merge and the atmospheric appears as a 
damped wave train of gradually increasing wave-length (see figure 1). Nevertheless 
the spacing of the waves conforms to the formula given by Laby for ionospheric 
reflexions from a layer at a height of about 00 km. (Schonland et al. 1940, p. 202), 



Figure 1. Typical daylight atmospheric. 

The slow component does not fit into this picture and Dr Schonland suggested 
to me in 1939 that it was possible that the slow component was propagated as a 
surface wave. It is the purpose of this paper to discuss this suggestion. 

[ 60 ] 



Propagation of the ‘ sbw ’ or tail component in atmospherics 61 


2. Observational material 

The observational material used in the discussion of the oscillatory component 
in §4 consisted of oscillograms of daylight atmospherics recorded by Schonland 
and his associates. The distances were determined from direction finder (D.F.) 
observations at Johannesburg and Durban. Comparatively few records of the slow 
component were obtained, possibly because the amplifiers were not effective at 
the frequencies of 100 to 500 cycles characteristic of this component. For the 
slow component reference must be made to the paper by Watson-Watt et al . 
(1937). (References to the figures of this paper will be prefixed by W., e.g. W. 
figure 1.) For the discussion in § 6, a knowledge of the velocity “frequency relation is 
desirable. This is not directly available but W. figure 10 gives (a) the duration of the 
first half-cycle, and (b) the interval between the times of arrival of the oscillatory 
and slow components, plotted against distance. These two curves are reproduced 
as figure 2 of the present paper. In addition W. figure 7 , reproduced (with some 
additions) as figure 3, gives a scatter diagram of the interval between the times of 
arrival plotted against distance. 



Figure 2. a f Interval between oscillatory and slow components (msec.) and 5, duration of 
first half-cycle of slow component (msec.) as functions of the distance (Watson-Watt et al, 
1937)- 

These figures will be discussed in greater detail in § 6. For the present it may be 
noted that the length of the first half-cycle varies from about 1*5 to 6 msec., and that, 
assuming that the onset of the oscillatory component travels with the velocity of 
light, the velocity of the slow component, estimated from the observations at 
4000 km., ranges from cf M to c /1 * 3 . 



62 A. L. Hales 



Figure 3. Scatter diagram of interval between the oscillatory and slow components 

(Watson-Watt el al. 1937 ). 


3. General theory 

Watson (1918) has given a complete solution of the problem of the propagation 
of electromagnetic waves between two media, either perfectly conducting, or with 
finite conductivity. This solution is more general than, and should inolude, those 
given below. Before this reference was found, however, the surface wave solutions 
given here had been set up and it is thought that they are more convenient than the 
general solution for the calculation of group velocities. 

In Gaussian units the Maxwellian equations are 


+ 47r<rj E = ourl H, 

( 3 - 1 ) 

divE = 0, 

( 3 - 2 ) 

/te>H 

c ~§iT = curE ’ 

( 3 - 3 ) 

div H » 0, 

( 3 - 4 ) 


/i will be taken to be unity for the earth, atmosphere, and ionosphere. The effect 
of the curvature of the earth will be neglected and the direction of propagation 
taken as the axis of x, the z axis being vertical. The solutions sought are such that 



Propagation of the ‘slow' or tail component in atmospherics 63 

the amplitude decreases exponentially with distance from the interface in the 
earth and ionosphere, and is a sine or cosine function of z in the atmosphere. In 
general there are two independent solutions of this type, namely (a factor 
e <w<t-v!v) h as been omitted throughout): 


(a) Vertical polarization 


E 0X = A 0 eM, 

~ 0 , 

~ iA n o) 
e «- 


« 


0a* 


0, 


Won 


- 0 , 


provided that 




E ix = ^4 1 sin^ 1 z + B i coh$,z, ' 
Ely = 0, 

E.„ = — (-J.cos.'i.z + Bjsinajz). 
,9, v 1 ’) 


( 3 - 5 ) 

( 3 - 6 ) 

( 3 - 7 ) 

( 3 - 8 ) 


H lx = 0 , 


H \v = 


1C 

S-t (0 


(^oos^z- WjSin^z) (s\ + a> 2 /v a ), 


H\z 0, 


( 3 * 9 ) 


provided that 


provided that 


p\a>*lc*~<o*lv 2 + sl 


E tx - 
E %y » 0, 

r . i/l «w u . 
^ = —-f-***. 




«2X = 0. 

/4 - o, 

p|(U a /c a = w a /t ,a - «i, 


( 3 - 10 ) 

( 311 ) 

( 3 - 12 ) 

( 3 - 13 ) 


where p* has been written for e — 4 m'<r/o>, and the suffixes 0,1 and 2 refer respectively 
to the earth, atmosphere and ionosphere. 



64 


A. L. Hales 


(i b) Horizontal polarization 


- °> 
E 0u = 
E 0Z = 0, 


subject to the condition ( 3 * 7 ). 


H ox = 


ICqCSq 


(O 




H 0V = 0, 




E lx - 0, 

K — C x sin $ x z + D x cos s x z y 

B» - <>. 


( 3 - 14 ) 


( 3 - 15 ) 


( 3 - 16 ) 


H lx = —((.'j cos .s, z-D x sin .^z), 

H >, - o. 

£ 

i/js » - (Cj sin«s*j 2 + Dj cos t 9, z ), 
subject to the condition (3-10). 


( 3 * 17 ) 


Eu - 0 . ] 

E %y = C 2 e^, i 

■®a* = °» J 




01 


^21/ ““ 




f? 


( 3 * 18 ) 


( 3 * 19 ) 


subject to the condition ( 3 * 13 ). 

The boundary conditions are that the tangential components of electric and 
magnetic force are continuous. These conditions lead to the equations of consistency. 

(a) Vertical polarization 


tan 

(6) Horizontal ^polarization 

tan 


pW+£ti , 

Plp\s\-P\a 0 8 t lPl 

S l ( S 0 + 1 

8* — Bq8% 


( 3 - 20 ) 

( 3 - 21 ) 


where h is the height of the ionosphere. 



Propagation of the 6 slow ’ or tail component in atmospherics 65 

Equations (3*20), (3*7), (3*10) and (3*13) can be solved to find v in terms of w. 
In general the solutions must be carried out numerically and thus it is desirable to 
express the equations in forms which do not involve h explicitly. In addition p 2 
for the atmosphere will be put equal to 1 . 


Substituting 

0)0C 

s= T , 

(3*22) 


C CT 

^ toh 2rrh 9 

(3*23) 

(3-7), (3*10) and (3*13) become « 1 -/;**- a 2 , 

(3-24) 


ct\ = 1-pl-af, 

(3*25) 


c z 

!-«!, 

(3*26) 

and ( 3 - 20 ), ( 3 - 21 ) 

. a i pSa 0 + pga* _ 

tan *— s *> a * &i i 

. (3*27) 


tan a i _ a l( a 0 + a 2> 

y a|-a 0 a 2 ‘ 

(3*28) 


4. Earth and ionosphere perfectly conducting 

If the earth and the ionosphere are assumed to be perfectly conducting both (3*27) 
and (3*28) reduce to 

sin — = 0. i.e. = rrn. (4*1) 

y y 

c 2 

From (3*26) = l-n 2 7 i 2 y 2 . (4-2) 

The group velocity, u t is then given by 

r = r. d{<i)lv) =_!_ (4-'A) 

u d(o (1 -»*jr*y*)*' 

A formula equivalent to (4-3) was derived by Haubert ( 1946 ). 

In figure 4, (c/u-l) is plotted against y for the fundamental mode, n ~ 1. 

The solution fails for nny > 1 , i.e. r > 2 hjcn, 

or taking h — 75km., n — 1, r>0>5msec. This solution therefore does not cover 
the range of periods of the slow components. The periods are those of the oscillatory 
component and the oaloulated frequency-group velocity relationship was therefore 
oompared with that observed in daylight atmospherios. 

For the comparison a group of six atmospherics occurring at 12.00 hr. on 10 June 
1939 was selected. The D.F. distance was 1200 km. 


Vol. 193. A. 




66 


A. L. Hales 

Let t 0 be the time of arrival of the first disturbance from an atmospheric originating 
at distance s at time t = 0, and let t be the time of arrival of a wave of period r 
travelling with group velocity u, then 


c 

s 


<*-<o> 



(4-4) 


assuming that the first disturbance travelled with the velocity of light. 



Figuhe 4 . Curves of c/m - 1 and a against y. ( 1 ) cju— 1 , for perfect conductors. ( 2 ) c/m - 1 
horizontal polarization, k = 0 * 04 , mode corresponding to ( 1 ). ( 3 ) a, horizontal polarization, 
k - 0 * 04 , mode corresponding to ( 1 ). ( 4 ) c/m - I, vertical polarization, k = 0 * 04 , mode corre¬ 
sponding to ( 1 ). ( 5 ) a, vertical polarization, k =* 0 * 04 , mode corresponding to (1), 



Propagation of the 'slow' or tail component in atmospherics 67 

Measurements were made of the time interval between the first departure from 
the central line and each later crossing* Let this interval be t n . It was assumed that 

the wave of period r « $ n+1 - t n „ x had arrived at time t » so that I repre- 

sented the quantity t -1 0 of equation (4-4), and (4-4) may therefore be rewritten as 

1+cl/s — c/u. (4-5) 



Figotub 5. Time of arrival against period for six daylight atmospherics. 


d/e was then plotted against r in figure 5. The curve of that figure is that obtained 
by calculation from (4-3) and (4-4), taking * = 1200 km., h = 75km. The curve fits 
the observations reasonably well, but it should be remarked that the corresponding 
curve for a «= 600, h » 57-5 km. fits just as well. 

As stated in § 1 , Schonland and his associates had found that the spacing of the 
waves in daylight atmospherics conformed to the Laby formula for reflexions from 



68 


A. L, Hales 


an ionospheric layer at a height of about 60 km. This raises the question of the 
relationship between the two modes of approach. 

Consider a record consisting of a series of pulses as shown in figure 6. Laby’s 
formula (the notation has been changed slightly) is 

8 

R being the radius of the earth. 


r h 1o»W\* 

1+ ” + s T ! ‘ 


R 


1 , 


ilJjl 


-t%n~ 






J1 


n 


Ficjubb 6 

Defining l and r as above it can be shown that (4-6) is equivalent to 


(4-6) 



/ dy l + (h/R) A* 

\ s) ]-(c*T 2 / 4**) V 

(4-7) 

or, in terms of y, 

( ciy 1 + (hjR) A* 

\ e) 1 — ttV a*’ 

(4-8) 

Using the formula for group velocity given in (4*3) and (4*4) 



( 1+ ?) = f-W 2 ' 

(4-9) 


(4-8) and (4*9) differ only by the small quantities hjR (which arises because of the 
curvature of the earth and was neglected in the present development) and h 2 js z . 
Both these terms are small and it is not practicable to discriminate between (4-8) 
and (4-9) on the present observations. The discussion has, however, shown that, in 
the case of perfect conductors, there is no solution corresponding even remotely to 
the slow component. 


In general p 2 = e 
that 4ncr/o)p€, 


6. Lavers of finite conductivity—solutions 

CORRESPONDING TO THOSE OF §4 
4 nicr 


a) 


. It will now be assumed that <r is sufficiently large so 


P* 


4nicr 


<o 


4ni<rhy 

7“ ' 


and l/p 2 and higher powers may be neglected. With these assumptions (3-27) and 
(3*28) reduce respectively to 


. oc x 
tan — 

r 


tan 


a. 


where 


k(l-i) 

"*“i V<2y)’ 

(5-1) 

kct x { 1 — i) 

V(2y) ’ 

(5-2) 

| Vo *+<>*■*)• 

(5-3) 



Propagation of the ; slow ’ or tail component in atmospherics 
Since a and v are now complex it is convenient to write 


1 

v 




69 


so that 
where 


exp{ia)(t-~x/v)} = exp jj exp ^ , 


a = — (oh l 


Solutions of (3*27) and (3*28) corresponding to those already found for perfect con¬ 
ductors can be found numerically, using the j>erfect conductor solution R(oc) = wry, 
I {a) « 0 as a starting point, u and a are plotted against y in figure 4 for k — 0*04. 

The curves for horizontal polarization do not differ very greatly from those for 
perfect conductors. For vertical polarization the attenuation is much greater than 
for horizontal polarization and the curves begin at finite values of y. 


6. Layers ok finite conductivity— 1 the slow component 

It has been established that in the case of perfect conductors the only solutions 
relate to the oscillatory component. The corresponding solutions for finite con¬ 
ductivity were discussed in the preceding section. It remains to be seen whether 
there is a solution of (51) or (5*2) which has no counterpart in the perfect conductor 
case. 

Assuming that a x /y is small it is easily found from (5*1) (vertical polarization) 
that, to a first approximation, 

a 1 = fc*y*e 8ff//8 . (6*1) 

There is no corresponding solution of (5*2) (horizontal polarization). Values of u 
and a calculated from the value of given in (6*1) are plotted against y and r 
(h = 60 km.) in figure 7. The periods are of the right order of magnitude for the slow 
component. 

Watson-Watt et al. remark (1937, p. 280) that near the source the alow component 
starts before the oscillatory component, there being considerable variation in the 
interval between the starting times. In figure 2 the mean interval is shown as 0*5 msec, 
and this interval has been used in deriving the velocities of table 1. Since the present 
discussion is concerned only with the mode of propagation and velocities, the signi¬ 
ficance of the difference in starting times with respect to the physical conditions 
governing the development of the slow and oscillatory components at the source 
will not be discussed here. 

The figures in table 1 were derived from figure 2. The table gives (1) the distance, 
(2) the time interval between the oscillatory and slow components, (3) the value of 
cju — 1 calculated on the assumption that the first onset of the oscillatory component 
travelled with velocity c, (4) the quasi period of the slow component, (5) the value of 
y calculated from (4) with h * 60 km., (6) the values of k found by interpolation, or 



70 


A. L. Hales 


extrapolation from the curves of figure 7, (7) the values of <r, calculated from (5-3) 
(er 0 has been taken as infinite). Where values of k and <r 2 have not been given it is 
because the values of c/m— 1 lie so far outside the range of figure 7 that the extra* 
polation would be unreliable. It is dear, however, that these points correspond 
to considerably lower values of <r 2 . 



Fichjbk 7. Curves of (e/u— 1) (dash) and a (dash dot) against 
y and r (h = 60 km.) (based on 6*1). 

The bias of the observations to larger values of k at short distances is consistent 
with the proposed mode of propagation since, as k increases, so also does the attenua¬ 
tion, and therefore it is to be expected that points corresponding to large values of k 
should only be found at limited distances from the source. 

The same type of conclusion can be drawn from figure 3. The curves in that figure 
are drawn for c/m — 1 = 0*075, 0*125 and 0*25 with arbitrary zeroes. It will be seen 
that the most usual value of c/m — 1 for the larger distances is in the neighbourhood 
of 0*125 and that a large proportion of the observations can be included between 
curves for c/m - 1 = 0*075 and 0*25. Taking r =* 10 msec., corresponding to distances 




Propagation of the ‘slow' or tail component in atmospherics 71 

of 2000 to 5000 km., <r t lies between 1-7 x 10 s and 18 x H^e.s.u. Onoe again a con¬ 
siderable number of points at the shorter distances correspond to larger values of 
c/u, larger values of k } and smaller values of tr 2 . 


interval be- TABLE 1 

tween slow 


distance 

(km;) 

and oscillatory 
components 
(msec.) 

e/u— 1 

, 200 

0-34 

0*50 

300 

0*41 

0*41 

400 

0*69 

0*52 

500 

0*81 

0*48 

600 

0*95 

0*47 

700 

109 

0*47 

800 

0*72 

0*27 

900 

0*91 

0*30 

1000 

1*41 

0*42 

1500 

1*50 

0*31 

2000 

1*60 

0*24 

2500 

1*60 

0*19 

3000 

1*78 

018 

5000 

2*01 

012 


quasi - 
period 
(msec.) 

y = cr/2nh 
h = 60 km. 

k 

e.s.u. 
( x 10~ 

3'8 

3*0 

— 

— 

2*8 

2*3 

..... 


3*4 

2*7 

--- 

— 

3*2 

2*5 



3*6 

2*9 

— 

— 

6*5 

4*4 


— 

6*8 

5*4 

0*6 

1 

4*1 

3*3 

..... 

— 

7*0 

5*6 

— 

— 

10*5 

8*3 

0*0 

1 

7*9 

0*3 

0*53 

‘ 1*4 

8*7 

0*9 

0*40 

2*5 

112 

8*9 

0*33 

3*7 

9*9 

7*9 

0*24 

6*9 


The order of magnitude found for <r 2 is much lower than that given by Appleton 
( 1935 ) the ionosphere. This is consistent, as Appleton’s discussion referred to 
layers effective at much higher frequencies than those under discussion here. The 
values of o * 2 depend on the choice of h. This choice was to some extent arbitrary since 
the only relevant information is that the reflexion formula applied to daylight 
atmospherics gives an effective height of fit) km. Any permissible change in h (say 
to 90 km.) would not, however, affect the order of magnitude for cr v 

Another point which may be noted in favour of the proposed explanation is that 
owing to the rapid increase in the value of a below y * ] there is no question of the 
higher frequencies of the oscillatory component being propagated in this mode. 

Qualitatively therefore the proposed mode of propagation does fit the observed 
facts. 

My thanks are due to Dr B. F. J. Schonland, F.R.S., for suggesting this problem 
and for much helpful discussion. 


References 

Appleton, E. V. 1935 Beports on progress in physics , 2, 129. 

Appleton, E. V. & Chapman, F. W. 1937 Proc . Boy. Soc . A, 158, 1. 

Haubert, A. 1946 Thesis presented to the Faculty of Sciences, University of Poitiers. 

Laby, T. H., Nichoils, F. G., Nickson, A. F. & Webster, H. C. 1937 Nature , 139, 837. 
Schonland, B. F. J., Elder, J. S„ Hodges, D. B., Phillips, W. E. & van Wyk, J“. W. 1940 
Proc. Boy. Soc . A, 176, 180. 

Watson, G. N. 1918 Proc . Boy . Soc. A, 95, 546. 

Watson-Watt, B. A., Herd, F. F. & Lutkin, F. 1937 Proc , Boy . Soc . A, 162, 267. 




Magnetization curves for ferromagnetic single crystals 
By H. Lawton and K. H. Stewabt, Cavendish Laboratory 
(Communicated by Sir Lawrence Bragg , F.R.S.—Received 23 July 1947) 

Magnetization curves for iron single crystals above the ‘knee’ are derived on the basis of 
domain theory for the case whore the spocimen is finite and the field is applied in an arbitrary 
direction with respect to the crystal axes. The shape of the specimen is important for the 
magnet ization process and it is shown that in many cases the demagnetizing field must be such 
as to make tho field actually acting in the crystal have a direction of symmetry (e.g. [ 111J or 
fUOJ) whatever the direction of the applied field. The coses of an oblate spheroid with its 
equatorial plane a (100) plane of the crystal and of a long rod with arbitrary orientation are 
considered in detail. In these cases simple expressions for the magnetization curves are 
Obtained. There is good agreement with the experimental results of various authors for both 
parallel and normal components of magnetization. A method for correcting for the effect of 
internal strains is indicated. 

1. Introduction* 

It is well known that the experimental!y observed magnetization curves for ferro¬ 
magnetic single crystals, where the field is applied along one of the principal direc¬ 
tions of the crystal, can be satisfactorily accounted for by the domain theory as 
developed by Akulov ( 1931 ) and others (Becker & Doring 1939 , chapter 9 ). Becker 
& Doring have, however, pointed out (p. 121 ) that it is difficult to extend the theory 
to cover arbitrary directions of the applied field and Takagi ( 1939 ) and von Engel 
& Wills ( 1947 ), considering experimental results by Honda & Kaya ( 1926 ), have 
concluded that the simple theory needs modification in order to explain observed 
facts. The difficulties arise when the direction of magnetization is not the same as 
that of the applied field, since in this case, the demagnetizing field—inevitable in 
a finite crystal—may modify not only the magnitude but also the direction of the 
field actually acting in the interior of the crystal, and the problem of finding the 
equilibrium configuration of domains is at first sight very complicated. The present 
investigation nevertheless shows that for the shapes of specimen of greatest practical 
interest, a strict application of simple domain theory leads to resonably simple 
results which agree well with experiment. By a suitable averaging process, the 
magnetization curves for single crystals with different orientations can be combined 
to give curves for polycrystalline specimens, but this development will not be dealt 
with in this paper. 

2. Principles or domain theory for iron crystals 

According to domain theory the intensity of magnetization in a ferromagnetic 
is everywhere equal to the saturation intensity I 8 but the direction of the magnetiza¬ 
tion varies from place td place in such a way as to make the free energy of the system 

* After this paper was completed, a paper by N6el (1944), dealing with the same subject, 
was brought to our notice. It is hoped that the present paper may still be of interest as its 
treatment differs from Noel’s in various ways. A summary of Noel’s paper in relation to the 
present one is given in the Appendix. 


t 72 ] 



Magnetization curves for ferromagnetic single crystals 73 

a minimum. A region in which the magnetization is constant in magnitude and 
direction is called a domain. The free energy is the sum of three terms, crystal energy, 
magnetic potential energy and strain energy; the last of these is zero in an ideal 
crystal and the other two are given, per unit volume, by the expressions 

E k — 

the first two terms of the power series in the a/s for a cubic crystal, and 

E H ~ ~I 8 Hcoax 

respectively, where K x and K 2 are ‘anisotropy constants', <o v w 2 , <w 3 , the direction 
cosines of the magnetization with respeot to the crystal axes, H the magnetic field 
„ and x the angle between H and I s . When K x is positive, as in iron, it follows that 
in zero field the six cube edge directions ([ 100 ] directions)* are directions of minimum 
energy of magnetization. As soon as a small field is applied, the [ 100 ] direction 
nearest to the field provides the deepest energy minimum and the domain magnetiza¬ 
tion will take up this direction. If the field is increased, the magnetization will turn 
from the [ 100 ] direction towards the field so as to keep the free energy a minimum, 
and this turning will increase the component of magnetization in the field direction. 
The free energy is 

E » -I 8 Hcoax + + + K 2 o)\(t)\o)l (1) 


per unit volume and from the condition that this should be a minimum, the following 
relations giving the magnetization when a field H is applied along the principal 
directions of the crystal are obtained (Akulov 1929 , 1931 ; Gans & Czerlinski 1932 ): 


Iff = 7 

l s 



H along [100] 

rj = 1 for all H > 0, 

(2) 

H along [110] 

-tfl-bv - --- 
V tV 4A y 

I 1 fo(7*« - 3) + V2 V( 1 - V 1 ) (*7 a - 1 )1 + fg {7(1-1 + 23?<) 

(3) 

H along [111] 



( 4 ) 

In these three simple cases the magnetization is in the same direction as the 
applied field and the calculations agree well with experiment. In other cases, as 
already pointed out, the calculation is complicated by demagnetizing effects which 
depend on the shape of the specimen. We shall consider two shapes only, that of an 
oblate spheroid and that of a long rod. These are the shapes for which most experi¬ 
mental datjp, are available. 

* In the text of this paper, [100] is taken to represent any one cube edge, not a particular 
one. In other words, no attempt has been made to name each cube edge separately, e.g. 
[ 010 ], [00T], etc. The same applies to [110] and [111]. In the diagrams for clarity, the separate 
directions have, however, been specified in the usual manner. 



74 


H. Lawton and K. H. Stewart 


The ease of the oblate spheroid where the equatorial plane is a (100) plane of the 
crystal and the magnetic field is applied in this plane is particularly simple since 
the problem becomes a two-dimensional one. This orientation for the oblate spheroid 
will be considered first and is the only one that will be dealt with in detail. 


3. Oblate spheroid 

Let the demagnetizing coefficient (the same for all directions in the plane of the 
spheroid) be N. Then if I is the mean magnetization of the crystal, the contribution 
of magnetic potential energy to the free energy of the crystal is given not simply by 

— I.H C , the potential energy per unit volume of I in the applied field H c , but by 

— I.H € 4' iNI 2 , the extra term representing the energy in the demagnetizing field. 
This additional term in the free energy may considerably affect the process of 
magnetization. In a system with no self-demagnetization the free energy, given by 
(1), is a minimum when the domain magnetization I 8 has the same direction at all 
points; when the term \NP is added to (1), the minimum of E may be achieved by 
an arrangement which splits the material into domains having different directions 
of I 8 so that I is reduced and JiV/ 2 decreases by more than the other energy terms 
increase. In writing the demagnetization energy as |2V7 2 it is assumed that the 
demagnetizing field ~NI is uniform throughout the crystal. This assumption is 
justified if the domains into which the crystal is divided are small compared with 
the crystal dimensions and if there are no 4 free poles ’ of I in the body of the crystal. 
The small size of domains is shown by many experiments and it can be shown that 
the presence of any appreciable density of 4 free poles ’ would lead to energy densities 
far higher than those involved in the arrangements we shall consider. 

It can be seen, by considering the effect of a change in the direction of magnetiza¬ 
tion of a small region of the crystal,* that E can be a minimum only if at every point 
in the material the direction of I 8 is such as to make the energy given by (1) a mini¬ 
mum, where H is taken to be the resultant field in the crystal, the vector sum of the 


* If a small fraction of the crystal has its magnetization (I s ) at an angle a to the mean 
magnetization, i lt of the rest of the crystal, then 

INI* * *AT{( I -/)• I\ + 2 < 1 -/) A .//* cos a} 


and tho rate of change of free energy per unit volume of the whole crystal when the fraction 
/ changes its direction of magnetization, the magnetization of the rest of the crystal remaining 


constant, is 


d d 

hHe-jr (cos + (cos a). 


neglecting quantities of the order of /A/(l —/) l x in comparison with unity. The crystal can 
only be in equilibrium if dE/da = 0 for all possible changes. The equation shows that this 
condition is the same as the condition that the energy given by (I) should be a minimum for 
every part of the crystal, where the field H of (1) is taken to be the resultant field, the vector 
sum of H 0 and tho demagnetizing field — NI, 

It has been assumed that the change in a has been made without creating any 'free poles 1 
of I in the crystal. More detailed consideration shows that it is always possible to make such 
a change. 



75 


Magnetization curves for ferromagnetic single crystals 

applied field H e and the demagnetizing field —NI. We can therefore say that the 
whole crystal can be in a state of minimum energy only when each of its domains 
is in a state of minimum energy, calculated as the sum of crystal energy and energy 
in the resultant field. When H is zero there are four directions of minimum energy, 
the zeros of crystal energy (in the (100) plane), and the magnetization in each 
domain may have any of these directions. The effect of a field H is to change the 
positions of these minima and to make some deeper than others. Since wall move¬ 
ments between domains with different directions of magnetization are supposed to 
be able to occur freely, the equilibrium state, giving minimum free energy for the 
whole crystal, will be one where all domain magnetizations have the direction of 
the deepest energy minimum. If, however, all domains are magnetized in the same 
direction then the demagnetizing field will be — NI 8 in the opposite direction and 
for small values of H t the resultant field will have such a direction that the deepest 
energy minimum is not the one in whose direction the domains are magnetized, and 
consequently the arrangement is not one of minimum energy. This is the case fore¬ 
seen above when the energy term \NP leads to a minimum energy configuration in 
which the direction of magnetization is not the same at all points. The only arrange¬ 
ment where the energy of the crystal can be a minimum with different domains 
magnetized in different directions is one where there are two or more ‘deepest 
minima ’ of energy for the domains. This can happen when the resultant field is 
zero, giving four equal minima or when it has a direction symmetrically between 
two of the minima of the crystal energy, i.e. a [110] direction, giving two equal 
minima. 

We can say that in the region of small H e any arrangement where all domains have 
one particular direction of magnetization is unstable because the demagnetizing 
field is such as to make some other of the directions of minimum energy more 
favourable. The only stable arrangement is one where domains are distributed 
between more than one energy minimum and such a distribution is only possible 
if the resultant field H c —Nl is symmetrical between two or more crystal energy 
minima. This condition of symmetry for H determines the distribution of domains 
between the energy minima since the mean magnetization I, and hence — iVI which 
determines the direction of H, depends on this distribution. As will be seen later, 
the distribution of domains between energy minima must change as H c changes in 
order to preserve the symmetry of H; as H c increases the number of domains in the 
minimum whose direction is nearest to that of H e will increase while the numbers 
in other minima decrease. Any distribution other than that required to make H 
symmetrical will be one for which the free energy of the crystal is not a minimum, 
since changes in the direction of magnetization of some part of the crystal would 
decrease the energy. 

The first part of the magnetization process is the turning of domain magnetiza¬ 
tions from one [100] direction to another, a process which occurs for infinitesimal 
values of H; this is the region of symmetry given by H ~ 0 with four possible directions 
of magnetization. The components of the magnetization, I p parallel to H e and l n 



76 


H. Lawton and K. H. Stewart 


perpendicular to it can be deduced very simply, for, since H 
H e - NI P and NI n must each be equal to zero giving 

p ~ N’ 

and L = 0. 


0 , its components 


(5) 


( 6 ) 


As H e is increased, therefore, I p will increase through the turning of domains from 
other [100] directions into the two [100] directions nearest to f/ e , but the distribution 
of domains between these two will always be such that /„ ~ 0. This process will be 
complete when all the domains are distributed between the two directions. If H c is 
increased further, there is no possible arrangement of domains among [100] direc¬ 
tions that can make H — 0, so that H will have a finite value and, as discussed above, 
its direction must lie symmetrically between two [100] directions (i.e. in a [110] 
direction). When H has a finite value the energy minima are, by equation (1), turned 
from the [100] directions towards the direction of H and the magnetization is thus 
increased. It can be seen that as H e is increased, the distribution of domains between 
the minima that is required to make H have a [110] direction will change, the 
proportion of domains in the direction nearer to H ( , increasing. In this region, which 
we shall call the ‘two-vector’ region, we can calculate the magnetization as follows. 

The field H has a direction symmetrically between two [100] directions, i.e. a 

[110] direction of the crystal. The magnitude of the component of magnetization 

in this direction will be independent of the distribution of domains between the 

[100] directions; it is in fact the same as the magnetization 7 lll0] (i/) that would be 

produced by a field H applied to an infinite crystal in a [J10] direction, and is given 

by equation (3). We can therefore derive the value of the field H as the sum of the 

component of H t in the [110] direction and the component of the demagnetizing field 

in the same direction, „ 

H - H e cos (45 — A) — Ii U 0 ] (H)y 


where A is the angle between H e and the nearest [100] direction, giving 

H f = sec( 45 °-A){tf + N 1 [U 0 ] (H)}. (7) 

Knowing H e in terms of H, we can calculate I p and I n from the condition that the 
demagnetizing field must make H lie in a [110] direction: 

//cos ( 45 ° — A) = H f — NI py 

H sin ( 45 °-A) = NI n > 


giving 


r H r ~H cos( 45 °-A) 
N 

4 = ~ sin ( 45 °-A). 


( 8 ) 

(») 


Equations ( 7 ), (8) and ( 9 ), together with equation ( 3 ) giving determine 

the magnetization. Figure 1 shows the values of I p and /„ calculated in this way for 



Magnetization curves for ferromagnetic single crystals 77 


A * 20°, N - 0 - 184 , K x = 4*2 x 10 5 erg./em. s , conditions corresponding to an 
experiment by Honda & Kay a (1926), whose experimental results are also shown 
in the figure. The parts Ox of the curves correspond to equations ( 5 ) and (6) for 
the region where H =* 0, the parts xy to equations (7), (8) and (9), The 'knee', x, 
between the two regions occurs where H - 0 in equations ( 7 ), (8) and ( 9 ), giving 


4 * 0, 4 


N cos A -f sin A* 


Equations ( 7 ), (8) and ( 9 ) apply so long as H remains in the [110] direction but, as 
was seen above, this condition becomes unnecessary for large values of H e (depending 
on N and A) because a stable arrangement with all domains in one direction, a 
* single-vector * arrangement, becomes possible. For this region we can calculate 
the magnetization as follows (Honda & Okubo 1916). 



Figure 1 . Calculated magnetization curves for oblate spheroid, with h = 20 °. Experimental 
points from Honda & Kaya (1926). Magnetization /„, in gauss, applied field, in 
oersteds. 


We will suppose that the domains, all lying in the same energy minimum, have 
turned through an angle /? from the [100] direction. The free energy of the domains 

u E = E k (P) - I 8 n e cos (A -fi) + \NI%, 

and the eondition for equilibrium is 

^-±E K tf)-I B H e Bin( A-fl-0, (11) 



78 H. Lawton and K. H. Stewart 

where E K (fi) is the orystal energy which, in this case, simplifies to 



E K (/3) ■ K x sin*/? cos*/?. 


Thus 

JP E k(P) = ^8in4/?, 


so that we have 

jj. K 1 sin 4/? 
e ~ 2/ g sin (A—/?)• 

(12) 



Figube 2. Calculated magnetization curves for oblate spheroid with A — 30 * and A = 40 °. 
Experimental points from Honda & Kaya (1926). Parallel component of magnetization, 
/ p , in gauss, applied field, H t1 in oersteds. 

Ip and I n are given by I p = I B oos (A-/?), (13) 

/„«*= 4sin(A-^). (14) 

Equations (12), (13) and (14) determine the magnetization for the ‘single-vector’ 
region, and give the sections y —z of figure 1. The intersection y between the curves 
for the ‘two-vector’ and the ‘single-vector’ regions clearly determines the point 
at which H just leaves the [110] direction, and all domains lie in one direction rather 
than distributed between two. It can be verified by more detailed calculation that 





Magnetization curves for ferromagnetic single crystals 79 

the number of domains in one direction tends to zero as we approach y along the 
‘two-vector’ curve and the direction of H tends to [ 110 ] as we approach y along 
the ‘single-vector’ curve, as would be expected. 

Magnetization curves calculated in the same way for A = 30° and A = 40°, using 
the same values for N and K v are shown in figures 2 and 3 together with the experi¬ 
mental points obtained by Honda & Kaya. In all cases it will be seen that the 
agreement with experiment is quite good; in fact, bearing in mind uncertainties in 

400 
4 » 


300 


200 


100 


0 

Figure 3 . Calculated magnetization curves for oblate spheroid with A = 30° and A = 40 . 
Experimental points from Honda & Kaya (1926). Normal component of magnetization, 
I n , in gauss, applied field, H„ in oersteds. 

the experimental results, such as rotational hysteresis and corrections for air flux, 
and possible errors in the value of N and K v it may be said that there is no significant 
difference between theory and experiment. The experimental I p curves are a little 
lower and more rounded than the theoretical ones; similar differences are well known 
to exist between experimental and theoretical ourves for magnetization in the 
principal directions of the crystal (Becker & Doring 1939 , p. 118) and are usually 
attributed to internal strains. Gans & Czerlinajri ( 1932 ) have proposed a method for 
correcting ourves for strain energy by use of the experimental curves for magnetiza¬ 
tion in a [100] direction, but the validity of their method seems doubtful. With our 
method of calculation, allowance can be made for strains by using experimental 
ourves obtained in the [110] direction instead of the theoretical equation (3) to give 
Z( U 0 ](H). Using Honda & Kaya’s results for /( UO i(if), the curves of figures 1 , 2 and 3 , 




80 


H. Lawton and K. H. Stewart 


are modified slightly and agree rather better with experimental values as shown by 
the dotted lines of figures 2 and 3. (Curves corrected in this way are shown for 
A a® 30 ° only.) 

The upper part of the I n curves (y—z in figure 1) have been obtained before and 
are known to agree with experiment for large H f , i.e. at points well to the right of 
figure 1 (Honda & Okubo 1916; Webster 1925; Gans 1933). 

The observed values of I n to the left of the transition point y , do not appear to 
have been explained previously* without modification of the simple domain theory. 
Takagi (1939) and von Engel & Wills (1947) developed a statistical theory in order 
to explain the curves; the theories involved the introduction of an arbitrary energy 
parameter E 0 which had to be found by comparison with experiment; E 0 was found 
to be much larger than the thermal energy and no satisfactory interpretation of 
2 ? 0 has been given. It now appears that the I n curves can be explained by a strict 
application of the simple form of the domain theory, good agreement with experi¬ 
ment being obtained using only values of constants that are already determined by 
independent experiments. 

A further check of the main idea of our theory can be made by determining the 
direction of H in the experiments of Honda & Kaya. The theory predicts that for 
external fields II r up to several hundred gauss the resultant field H should lie in a 
[110] direction. The resultant field can be calculated from the experimental values 
of H f , I pf l n and N and is in fact found to lie close to a [110] direction in the appro¬ 
priate range of H e . The test is not very exact since the direction of H is sensitive to 
errors in the experimental data for I p and N. The fact that H is frequently found to 
lie near a [110] direction was commented on by von Engel & Wills (p. 481 ) but was 
found to conflict with, rather than to support, their theory. 

4. Long rod 

The other shape of specimen we shall consider is a long rod with the external field 
applied in the direction of its length. The rod is taken to be so long that there is no 
demagnetizing effect along its length, while the demagnetizing coefficient in all 
directions at right angles is of the order of 2 n. The orientation of the rod will be 
defined by a v a 2 , a 3 , the direction cosines of its axis with respect to the crystal axes. 

It will be shown that it is possible to calculate the component of magnetization 
parallel to the rod ( I p ) to a very close approximation without knowing the exact 
value of the demagnetizing coefficients N perpendicular to the rod. It is not possible 
to calculate the normal component of magnetization without knowing N, but 
the value of I n is always small and no measurements of I n for long rods appear to 
have been made, so our calculations will be confined to / p . 

As before, the first part of the magnetization process is the turning of domains 
into the easy directions nearest to the field/and will be completed in an infinitesimal 
field H t which in this case corresponds to an infinitesimal external field H e , since 


* But see footnote, p, 72. 



Magnetization curves for ferromagnetic single crystals 81 

there is no demagnetization in the direction of H c . The distribution of domains when 
this process is complete, must however, be such as to make I n zero since a finite I n 
would produce a finite demagnetizing field NJ n perpendicular to the rod and lead 
to instability of domain distribution of the type discussed in the oblate spheroid 
case. It can be shown (Becker & Doring, 1939, p, 119 ) that such a distribution— 
every domain in one or other of the three directions nearest the field, yet giving 
I n — 0—produoes a parallel component of magnetization 



and this is, therefore, the value of / when the processes occurring in infinitesimal 
fields are complete. The first part of the magnetization curve ( Ox , figure 4 ) is thus 
a vertical rise in zero field to the value / s /a 1 + a 2 + a 3 . Figure 4 is calculated for an 
iron single crystal rod (no. 3 ) measured by Sizoo (1929). 



0 100 200 300 400 He 500 


Fiou&k 4 . Calculated magnetization curve for long rod with orientation corresponding to 
Sizoo’s crystal 3 (1929)# for which the experimental values, corrected for longitudinal demag¬ 
netizing coefficient (N = 0 * 00344 ), are shown as 0 . Magnetization, /, in gauss, applied field, 
H„ in oersteds. 


When H e is increased from its infinitesimal value, H will also increase to a finite 
value and the same sort of argument as was used in the case of the oblate spheroid 
shows that, in general, the distribution of domains among the three easy directions 
nearest to must be such as to make H have a [111] direction, symmetrically 


VoL 193. A. 


ft 



82 


H. Lawton and K. H. Stewart 


between three [ 100 ] directions. Any other direction of H would cause all the domains 
to turn to one particular direction rather than to distribute themselves, and such a 
‘single-vector * arrangement can, as before, be shown to have such a large demagne¬ 
tizing energy that it is not a configuration of minimum free energy. The process of 
magnetization in this region just above the knee x , the ‘three-vector’ region, is 
illustrated by figure 5 which represents various vectors as points on the surface of 
a unit sphere At the knee the domains are distributed among the three [ 100 ] 
directions, A> B and C which are nearest to the applied field H fl . Above the knee the 
resultant field H , increasing as H e increases, is in the [111] direction and the energy 
minima for domains move symmetrically towards H along lines AA\ BB\ GC 
The proportions of domains in the AB' and C* directions will change as H e increases 
so as to provide a demagnetizing field that always has the magnitude and direction 
required to make H lie in the [ 111 ] direction. We can calculate the magnetization 
curve for this region {x — y in figure 4 ) as follows. 

C 

[oofl 



The components of H are H c parallel to the length of the rod and a demagnetizing 
field at right angles. We have therefore 

H oos \jr = H e , 

where y is the angle between H e and the direction of H, i.e. the [ 111 J direction so that 


a 1 + a a +a s 

•*** — jt~■ 


giving 


H 




Otj *+“ Gt$ + 


( 16 ) 




83 


Magnetization curves for ferromagnetic single crystals 


Using this value of H , we can find the intensity of magnetization I[ lU )(H) in the [111] 
direction by using equation ( 4 ). The component of I in this direction will be correctly 
given by equation ( 4 ) whatever the distribution of domains among the three possible 
directions, but the distribution may, in fact, be such that I has a considerable com¬ 
ponent in some other direction. The exact distribution required to make H have 
a [111] direction cannot be found without knowing the value of N but, as will be 
shown later, in most cases the direction of I required to produce the necessary demag¬ 
netizing field is extremely close to the axis of the rod (the H r direction). We can there¬ 
fore find I p to a close approximation, by assuming that the total magnetization, /, 
has the direction (a v a 8 , ctg), while we know that its component in the [111] direction 
is )• 


This gives 


j _ j _ 

p cos \jr ~ a x + a 2 + a 3 * 


( 17 ) 


Equations (10) and ( 17 ), together with equation ( 4 ) for / [mI (Z/), give the relation 
between I p and H e ; this is the portion x —y of figure 4 . For the calculation we have 
taken K 2 = t which is true to the accuracy with which K 2 is known for pure iron 
and for 3-85 % silicon iron (Becker & Doring 1939, p. 123 ; Williams 1937). 

At this stage we must consider the error caused by assuming that / had the 
direction of the rod. Actually I will be inclined at a small angle e such that the 
demagnetizing field AM sine, compounded with the field H e , produces a resultant 
(H) in the [ 111 ] direction. We have, therefore, 


NI sine 

~~bt 


tan t/r, 


( 18 ) 


and for the specimen of figure 4 , taking N = 27 t, I - 1500 , H = 160 , \Jr * 20 * 75 ° 
we get £°. The fractional error in I p caused by assuming / to have the direction 
of the rod is {sec ijr — sec (\]r - e)}/sec i/r which, in this case, is about l % and may be 
neglected. The error is of the same order of magnitude in most of the other cases 
considered. 

The ‘three-vector* region just considered will not extend to saturation, for as the 
domains turn towards the [111] direction in increasing field, a stage is reached when 
two of the domain directions ( A ' and B ' in figure 5 ) become coplanar with H e . At 
this stage, even if all domains are distributed between the A' and B' directions only, 
there will be no demagnetizing field perpendicular to the DA'HiB'E plane which 
would cause instability by making the O' direction more favourable than the A f 
and B f directions. In fields higher than that required to make the A f and B ' direc¬ 
tions coplanar with H e , we have therefore a ‘two-vector’ region, the domains being 
distributed between two directions A' and B ' only, which move towards each other 
in the DH e E plane as H 6 is increased. B need no longer be symmetrically placed 
between the three vectors A , B and C } but only between the two, A f and B\ so that 
it is not confined to the [ 111 ] direction but only to the (110) plane \CF in figure 5 ). 

The transition from the ‘three-vector’ to the ‘two-vector* region will, in fact, 
occur just before the domain directions A' and B' become coplanar with H t because 

6-2 



84 


H, Lawton and K. H* Stewart 


a 4 two-vector’ arrangement will be stable even when there is a demagnetizing field 
perpendicular to the DU f E plane, provided that this field is insufficient to make the 
resultant field H lie nearer to C* than to A* or J3'; The actual positions of the energy 
minima at the transition will therefore be at a small angle below the line DH e E 
in figure 5 , but it can be seen that this angle is of the same order as the angle e of 
equation ( 18 ), and the error in the subsequent calculations, if we assume the angle 
to be zero, will be negligibly small. At the transition point, H will be in the [HI] 
direction and as H e increases, the domain magnetizations will move towards the 
point F in figure 5 , remaining just below the line DH C E , but approaching it more and 
more closely, while fl will move down from [111] towards F , remaining in the (110) 
plane, CF. 

With this approximation, that the domain magnetizations are accurately in the 
plane DH e E instead of just below it, we can calculate the ‘two-vector’ part of the 
magnetization cu^ve (y —-z in figure 4 ). We have first to calculate the angle <j> 
between the (110) plane (point F) and the directions of minimum energy A " or 
in a given field H e1 then to calculate the distribution of domains between the sym¬ 
metrical energy minima A * and B" and hence find I p . 

Let 0 be the angle between the plane DH^E, which contains H e and is symmetrical 
with respect to the [100] directions A and B and the (100) plane DA BE. Let /i be 
the angle between H r and the (110) plane CF which lies between A and B. For a 
given specimen these angles 0 and /i (see figure 5 ) are determined by the orientation 
(a v a 2 , a 3 ) according to the equations 


. .Cos Q 
cos n = K + a 2 )— 


+ a 3 sin 0 . 


( 20 ) 


In equation (1) for the free energy, when the magnetization is in direction A" or B", 
X is the angle between H and direction A " or B" ando^, to v o> 3 are the direction cosines 
of A" or B" given in terms of 6 and <}> by 


« cos 6 coa0±sin0, 

<sj2a>2 — cos 0 oos0 + sin^, 
w 3 » sin 6 cos0. 

Since H e is the component of H in the direction so that H cos// cosy « H et 
and it can be seen from figure 5 that 

coax = ° os 0 cosy 

where y is the angle between H and the DH e E plane, we have 


H 


cosx = - 


H e oos<f> 


COS// 



Magnetization curves for ferromagnetic single crystals 85 

The equation for the equilibrium domain direction therefore becomes 

Q, - 0 - 

+ cos 2 0 ( 4 sin 4 0 — cos 2 0 — cos 4 0 )} 

K 

- sin 2 6 sin 20 {cos 2 0(1 + cos 2 0 ) - I } {3 cos 2 <f>( 1 + oos 2 0 ) - 1 }. ( 21 ) 

This equation enables I 8 HJK X cos ji to be calculated as a function of 0 and 0 and the 
numerical results for various values of 0 and 0 are given in table 1 . The calculations 
have been simplified by assuming K 2 = \K X (see above). Values calculated with 
K 1 = 0 are also given in the table and it can be seen that the results depend appreci¬ 
ably on the value of K % so that the uncertainties in K 2 will cause some uncertainty 
in the calculated curves. 


Table 1. Values of I s HJ(K 1 cos / 1 ) in terms of 0 and 0, 

TAKING K % = \K V AND (FIGURES IN BRACKETS) K 2 = 0 



0 a 0° 

10° 

20° 

30° 

40° 

& » 0° 

200 

1-85 

J *44 

0*87 

0-27 


(2*00) 

(1-85) 

(1*44) 

(0*87) 

(0*27) 

10° 

2*00 

1-85 

1*44 

0*88 

0*28 


(1*97) 

(1-82) 

(1*42) 

(0*87) 

(0*28) 

20° 

1*97 

1-82 

M2 

0*88 

— 


(1*84) 

(1-71) 

(1*34) 

(0*82) 

— 

30° 

1*76 

1-63 

1*28 

— 



(1*56) 

(1-40) 

(MO) 

— 

— 

35° IS' 

1*55 


— 

— 

— 


(1*33) 

— 

—■ 

— 



For a given 0 the component of I in direction F will be 7 ^cos 0 , whatever the 
distribution of domains between A " and B". As before, we can say that the direction 
of 1 will be very close to the axis of the specimen so that we have as a good approxi¬ 


mation for 


w 


I s COS 0 
COS fi 


( 22 ) 


6 and // are fixed by the orientation of the specimen (equations ( 19 ) and ( 20 )) so 
that for a given specimen the values of T f , and H e corresponding to a. set of values 
of 0 can be obtained from equation ( 22 ) and table 1 (using graphical interpolation 
where necessary). This method gives the relation between I p and H e plotted as 
section y—z of figure 4 . 

This curve for the ‘two-vector* region intersects that for the ‘three-vector* 
region in point y. It can be verified by more detailed calculation that the number 
of domains in the <7 direction diminishes to zero as we approach y along the ‘three- 
vector* curve and that the direction of H approaches [ 1 X 1 ] as we approach y along 
the ‘two-vector* curve, indicating that the intersection at y does genuinely define 




86 


H. Lawton and K. H. Stewart 


the transition from the * three-vector ’ to the ‘two-vector’ region, to the accuracy 
of the approximation disoussed above. 

The ‘two-vector’ region will not extend to saturation for as the domain directions 
turn towards each other along DH t E the A H domains will approach the direction 
of the specimen axis (H t ) and a point will be reached at which the demagnetizing 
field, even if all domains have the A " direction, will be insufficient to make H lie 
nearer to B" than to A", so that a ‘ single-vector ’ arrangement of domains becomes 
possible without instability. This second transition point is defined by 


Nig sin (0-//) 

H r 


= tan //,, 


( 23 ) 


and therefore depends on the value of N, but in most cases this transition occurs at 
values of I p very near saturation, so that a single-vector region is unimportant in 
magnetization curves as usually plotted. For the specimen of figure 4 , for example, 
the value of I p at the second transition is 0-997 I a if we assume N « 2 n. 



Figure 6 . Calculated magnetization curves for long rods (iron) with orientations corre¬ 
sponding to Kaya’s (1933) crystals 3 and 18 . Points taken from Kaya’s experimental curves 
are also shown. Magnetization, /, in gauss, applied field, H $t in oersteds. 


The calculation of the magnetization curve in the 1 single-vector ’ region is a 
rather complicated three-dimensional problem, since there are no restrictions of 
symmetry on the direction of H, Becker & Boring (1939) have given & treatment of 
this last part of the curve which applies, however, only to fields so high that demag¬ 
netizing effects can be ignored. 




$ Magnetization, curves for ferromagnetic single crystals 87 

Theoretical I p \ H e curves calculated by the above methods for various crystals 
are shown in figures 4 , 6 and 7 , together with points found experimentally. Figures 
4 and 6 refer to crystals of iron used by Sizoo (1929) and Kaya (1933) respectively. 
The values used for the theoretical curves were K x » 4-2 x 10 6 erg/e.c., 

Is ** 1^20 gauss. Figure 7 refers to two single crystal specimens of 3*15 % silicon- 
iron about 3 x 0*1 x 0*03 cm. 8 for which we ourselves have measured magnetization 
curves, using a solenoid to provide the field H e and a small search coil and ballistic 
galvanometer to measure /. The values used in the calculations were K t = 2*8 x 10 6 
erg/c.c., — \K V I s « 1605 gauss. The orientations of all specimens in terms of 

(a 1? a a ,a 3 ), are shown in the figures. The orientations of our specimens were found 
by the method of optical reflexions from etch pits giving an accuracy of 2 to 3°. 



Fiottkhj 7 . Calculated magnetization curves for long rods (iron-silicon), with orientations 
corresponding to our specimens (74 and 07 . Experimental points (corrected for longitudinal 
demagnetizing effect) are also shown. Magnetization, /, in gauss, applied field, H c > in oersteds. 


It will be seen that the agreement between theory and experiment is not perfect. 
In oiy own measurements, the uncertainties of experiment, particularly the correc¬ 
tion for air flux in the search coil were rather large but probably not quite sufficient 
to account for the whole of the deviation from theory. The differences can probably 
be attributed to imperfections in the crystals, leading to an appreciable strain energy 
term which should appear in equation (1). Corrections for strains can be made using 
an experimental [111] curve in a similar manner to that described for the oblate 
spheroid. The only [111] curves that are available are, however, ones such as those 



88 


H. Lawton and K. H. Stewart 


of Honda & Kaya (1926) or Williams {1937) whioh were measured on specimens 
from quite different batches of material, where the strains may differ considerably 
from those in the long rods whose magnetization curves are to be corrected. The 
curves cannot therefore be corrected at all accurately without more data. 


Appendix 

Noel’s paper is in three parts; the first covers much of the same ground as the 
present paper while the other two develop the details of domain structure required 
by the considerations of the first part and show that these afford a satisfactory 
explanation of many observations on Bitter patterns. In the first part N6el points 
out that the number of ‘ phases ’ (different directions) of magnetization that can 
exist in a crystal depends on the direction of the resultant field. The magnetization 
curve is interpreted as being made up of 6-, 3 -, 2 - and 1-phase regions successively, 
corresponding to H — 0, H in a [111 ] direction, H in a [110] direction and arbitrary 
H respectively. This leads to equations for the magnetization curves of oblate 
spheroids and long rods which are identical with those we have obtained except that 
we have treated the 2-phase region of the long rod more exactly than N£el. We have 
also dealt rather more fully than N6el with the action of the demagnetizing field 
in ensuring a symmetrical direction of H and with the errors introduced by various 
approximations in the theory. 

We should like to thank Dr D. Shoenberg for his encouragement and advice and 
Professor E. C. Stoner, E.R.S., for valuable discussion. The work forms part of a 
programme of research sponsored by the Electrical Research Association to whom 
we are indebted for permission to publish. 


References 


Akulov, N. S. 1929 Z. Phys. 57 , 249 . 

Akulov, N. S. 1931 Z. Phys . 69 , 78 . 

'Hooker, R. & During, W. 1939 Ferromagnetismus . Berlin: Julius Springer, 
von Engel, A. & Wills, M. S. 1947 Proc. Roy. Son. A, 188 , 464 . 

Gans, R. 1933 Article in Magnetismus (Leipzig lectures), edited by P. Debye, pp. 91 ff. 
Leipzig: 8. Hirzel. 

Gans, R, & Czerlinski, E. 1932 Schr. K&nigsb. gelekrt. Gcs . naturw. Kl. Bd. 9 , 1. 

Honda, K. & Kaya, 8. 1926 Set. Rep. TShoku Univ . 15 , 721 . 

Honda, K. & Okubo, J. 1916 Sci. Rep. TShoku Univ . 5 , 153 . 

Kaya, 8. 1933 Z. Phys . 84 , 705 . 

N6ol, L. 1944 J. Phys. Her. 8, 5 , 241 . 

Sizoo, G. J. 1929 Z . Phys . 56 , 649 . 

Takagi, M. 1939 Sri. Rep . TShoku Univ . 28 , 26 . 

Webster, W. L. 1925 Proc. Roy. Soc. A, 107 , 496 . 

Williams, H. J. 1937 Phys. Rev. 52 , 747 . 



On the inhomogeneity of plastic deformation in the 
crystals of an aggregate 

By W. Boas and M. E. Hargreaves 

Council for Scientific and Industrial Research , Section of Tribophysics , Melbourne 
(Communicated by Sir David Rivett , F.R.S. — Received. 11 August 1947 ) 

[Flat oh l and 2 ] 

The variation of plastic deformation in aluminium specimens consisting of large crystals lias 
boen determined by measuring elongation and hardness at various points after tensile defor¬ 
mation, The deformation varied from grain to grain, and abo within each grain the deforma¬ 
tion near the boundary was greater or smaller than at the centre according to whether 
the neighbour was more or less deformed, i.o. them is not necessarily inhibition of slip near 
grain boundaries. These results were supported by metallographie and X-ray observations. 
Their importance with respect to the calculation of the stress-strain curve of aggregates 
from those of single crystals is discussed. It is suggested that a mechanism other than slip 
operates near the grain boundaries during deformation, and even within the crystals during 
large extensions. 

Introduction 

Any theory of plastic crystal aggregates requires a knowledge of the behaviour of 
the crystals during deformation and of their interaction. Whereas the behaviour 
of single crystals under stress is well known, the effect of their interaction is not 
yet quite understood. Obviously, a crystal embedded in other crystals deforms 
differently from a crystal which is free from any constraint by neighbours. Such 
differences have been studied, particularly in aggregates of large-grain size, and 
the effect of the boundaries has been stated to consist in inhibition of slip. 

Carpenter & Elam (1921) observed that during tensile deformation of coarse¬ 
grained aluminium the crystals exerted a supporting effect on each other. The 
deformation at the boundary was much less than in the centre of the crystals, and 
this effect was still noticeable at a distance of 0 - 25 in. from the boundary. Moreover, 
they noted that k a crystal which narrowed on pulling in contact with one which 
remained broad tended to make the latter narrow also. In its turn, the crystal 
which remained broad, tended to prevent the other one from narrowing.' 

In subsequent investigations only the fact that the boundaries restrict plastic 
deformation lias been considered. Aston (1926) found that the change of crystal 
orientation during deformation is less near a grain boundary than at the centre of 
a crystal. His conclusion that the extent of deformation is correspondingly smaller 
near the boundary does not seem to be justified, since deformation due to multiple 
slip gives rise to a smaller change in orientation than the same deformation due to 
single slip. The diffuseness of the X-ray interferences usually increased as the 
boundary was approached indicating a greater amount of bending of the lattice 
planes. 


[ 8 » 1 



90 W. Boas and M. E. Hargreaves 

Yamaguchi (1927) determined the local elongations from marks scratched on the 
surface of aluminium specimens. He found that ‘the elongation at the boundary 
is always smaller than that of the adjacent crystals’, although some of his evidence 
contradicts this statement. 

The conclusion reached from all these investigations was that less deformation 
occurs near a grain boundary than in the oentre of a grain. This slip interference 
producing an increased resistance to shear has been cited as an explanation of the 
higher hardness observed for aggregates compared to single crystals. The only 
quantitative investigation of this strengthening effect was carried out by Chalmers 
(1937). He measured the elastic limit of bicrystalline specimens of tin and found it 
to increase approximately linearly with the angle between the tetragonal axes of 
the two crystals. The area of the boundary did not influence the extent of strength¬ 
ening, and ‘the critical shear stress is raised by the proximity of a boundary, and 
the amount of slip that takes place is decreased’. 

Chalmers’s experiments dearly show that the change in orientation at the boun¬ 
dary is the important factor for the strengthening effect. The change in orientation 
also gives rise to different deformations within a crystal at different distances from 
the boundary. Barrett & Levenson (1940) have drawn attention to this inhomogeneity 
of the deformation. They have shown the existence of deformation bands which 
indicate that various parts of a crystal suffer different deformations. 

The present investigation was undertaken to obtain quantitative data on the 
extent of the inhomogeneity and to ascertain whether the deformation is always 
restricted near the grain boundaries. 

Experimental, procedure 
(a) Preparation of specimens 

The material used was commercial aluminium of 99 * 15 % purity, the main 
impurities being Fe 0*5 % and Si 0*28 %. Tensile specimens having gauge length 
6 x 1 x £in. were prepared with a grain size of 1 to 3 cm. These were first annealed 
at 200° C and back-reflexion Laue diagrams made to determine the orientation of 
the various grains relative to the specimen axis. One face of each specimen was 
mechanically polished and the specimens annealed again at 200 ° C. The final surface 
preparation was by electrolytic polishing, leaving a strain-free surface. 

On this surface the grain boundaries could be seen microscopically as fine lines, 
but to facilitate their location the specimen was very lightly etched. Inclusions of 
impurities were present, but they were not preferentially distributed at grain 
boundaries. 

(6) Hardness measurements 

Three types of indentation hardness measurement were attempted: 

(i) Knoop diamond, load 25 g. 

(ii) Vickers diamond, load 200 g, 

(iii) Vickers diamond, load 1 kg. 



Plastic deformation in the crystals of an aggregate 91 . 

It was found that the first two of these types were not useful after deformation 
of the specimens, as surface rumpling caused a wide scattering of the results which 
were not reproducible. Measurements with the Vickers diamond and 1 kg. load were 
much less sensitive to this effect and were later used exclusively. The probable 
error of these hardness measurements was ealc\ilated as less than 0-3 hardness unit, 
i.e. approximately 1*5%. 

Hardness before deformation was measured by making a straight line traverse 
of hardness indentations parallel to the axis of subsequent tensile deformation. The 
least distance between any two impressions was approximately five times the 
length of the diagonal of an impression. 

The distance apart of the impressions was measured with a cathetometar to 
±0*001 cm., illumination being arranged so that one face only of the pyramidal 
impression reflected the light source. This gave a convenient and accurate reference 
line which was independent of the size of an impression. 

(c) Measurement of local deformation and hardness 

The specimens were then deformed in tension to 5 % elongation. The known dis¬ 
tances between hardness impressions made before deformation were now used as 
gauge lengths and remeasured after deformation. The elongations of individual 
grains and of various areas within particular grains were calculated from the 
differences between distances measured before and after deformation. The error in 
each measurement of length was ± 0*001 cm., and hence in determining elongation 
on a gauge length of 2 nun. the error in the value for percentage elongation was 
± 1*0, and correspondingly larger for smaller gauge lengths. The extent to which 
elongation measurements could be localized was limited by this factor. 

In order to determine the hardness after deformation a traverse of indentations 
was made parallel to the one made before deformation. Finally, Laue diagrams were 
made at several points suggested by the results obtained in the elongation and 
hardnesB measurements. 

One specimen was then further deformed in tension to 17 % total elongation and 
the local deformation measured as before. 

Results 

Typical of the results obtained are those for two specimens shown in figures 1 
and 2. Hardness after deformation and percentage elongation are plotted against 
distance on the specimen. The values for elongation are plotted in the centre of the 
gauge length on which they were measured, and where possible a smooth curve is 
drawn. The heavy vertical lines represent the positions of grain boundaries; the 
grain numbers refer to the diagrams showing the surface configuration of the grains 
in the specimen. Also tabulated for each grain on the graph are the mean hardness 
value before deformation and the mean elongation. Hardness before deformation 
was constant in any grain within the limits of experimental error. The type of slip 
bands observed in various areas is indicated schematically in figure la. 



elongation (%) V.H.N. (ijkg. load) 


92 


W. Boas and M. E. Hargreaves 

It is obvious from the curves that the overall elongation of 5 % given to the 
specimen produces widely different elongations in the individual grains. Moreover, 
the deformation is not uniform within one grain. As a boundary is approached the 
deformation becomes larger than at the centre if the average deformation of the 
neighbour is larger, and in the neighbour the deformation is less near this boundary 
than at its centre. This feature is even more apparent in the specimen whioh was 
extended by another 12 % (figure 2). The elongations which are so different in the 
central portions of the grains become similar near the boundaries. 

it can also be seen that the curves for hardness after deformation have the same 
trend as the elongation curves. Since hardness before deformation was constant 
within any grain, the curves for hardness after deformation indicate the distribution 
of work hardening and therefore of deformation within one grain. 

The hardness measurements thus complement the observations made above with 
respect to the effect of the neighbour and give a better indication of the local deform¬ 
ation than the elongation measurements. The latter are average values over their 
respective gauge lengths, and the extreme variations thus obtained are therefore 
smaller than the actual variations. However, the degree of work hardening cannot 



0 l 2 3 4 5 6 7 


(a) 



Plastic deformation in the crystals of an aggregate 


93 



0 1 2 3 4 5 6 


cm. 

( 6 ) 

Figure ]. Hardness and local elongation of the grains of an aggregate after 5 % elongation, 

(а) Specimen 19 . Slip occurring at various points in the grains is shown schematically. 

(б) Specimen 5 . 



cm. 


Figure 2, Local elongation of the grains of an aggregate after 17 % elongation. Specimen 19. 



94 


W. Boas and M. E. Hargreaves 

be used to compare the degree of deformation of various grains as the rate of work 
hardening varies with the orientation of the grain. 

These results are further supported by the evidence from microscopic observation 
of slip lines. The density of slip lines varies within a grain in a manner consistent 
with the elongation and hardness values for the grain. Where these measurements 
indicate more deformation near the boundary of a grain relative to the centre, the 
slip lines become more closely spaoed as the boundary is approached and sometimes 
double slip appears. Similarly, when approaching a boundary where less than 
average deformation is indicated slip lines are wider spaced, and sometimes where 
double slip occurs throughout the grain it is replaced by single slip. These features 
are illustrated by figures 3 and 4, plates 1 and 2. Finally, the inhomogeneity of 
deformation is confirmed by observation of the spots in the Laue diagrams made at 
various points in the grains. The spots are more diffuse in diagrams resulting from 
areas where high deformation is indicated by the other measurements. 

Conclusions and discussion 

From the experimental evidence the following conclusions can be drawn: 

(1) During plastic deformation of an aggregate there are wide differences in the 
amount of deformation and work hardening from grain to grain. The overall extent 
of deformation and the average hardness of the aggregate do not indicate the 
behaviour of individual grains. 

(2) Deformation is also inhomogeneous within each grain and corresponding 
hardness variations due to work hardening occur. 

(3) The inhomogeneity of deformation is due to interaction of neighbouring 
grains and may extend up to 1*5 cm. from the grain boundary. The direction of this 
deformation variation is such that the deformation approaches that of the neighbour, 
i.e. the deformation tends to be continuous across the boundary. 

The question now arises as to what determines the average deformation of a grain. 
No correlation could be established with the hardness before deformation. Although 
an indentation is produced by the same general process as elongation, i.e. by slip, 
the particular slip systems operating and the stress conditions are quite different, 
and hence the lack of correlation is not surprising. One would rather expect the 
original orientation of the crystal to be important in determining the average 
deformation of a grain. For purely tensile deformation the ratio of shear stress along 
the slip system to the external stress is given by the function sin x cos A, where x 
and A are the angles between the direction of tension and the relevant slip plane and 
direction respectively. If that system acts, along which the shear stress is a maxi¬ 
mum, the corresponding value of this function should express the susceptibility of 
a grain to deformation as determined by its orientation, for not only is the shear 
stress acting on the system proportional to this function but also the elongation 
for a given amount of shear. In figure 5 the maximum values of sin x cos A for the 
grains represented by figure 1 have been plotted against the average elongations 



Plastic deformation in the crystals of an aggregate 95 

of the grains. Also plotted from the results of Karnop & Sachs ( 1927 ) are the elonga¬ 
tions of single crystals of appropriate orientations under the same tensile stress as 
was used to deform the aggregates in the present experiments. 



0-36 0-40 044 0*48 

sin x cos A 


Figure 5, Elongations of crystals under the same external stress as a function of their 
orientation, x Average elongation of crystals in aggregates as observed in these experiments 
after 5% overall elongation. O Elongation of free single crystals observed by Kamop & 
Sachs. 

Two features are evident : 

(а) While grains of high average elongation are grains for which the value of the 
function sin X 008 A is high the converse is not true. 

( б ) The points for grains in the aggregates lie about the curve for single crystals 
under the same stress. 

These observations lead to the conclusion that the distribution of stress in the 
aggregate is not uniform, some grains receiving more and some less than the extern- 
ally applied stress. Thus while the product sin y cos A determines the susceptibility 
of a grain to deformation under a pure tensile stress, the geometry of the aggregate 
influences the actual stress on the grain and thus the extent of its deformation. 

The observations are also of interest with respect to the calculation of the stress- 
strain curve of polycrystalline materials from the shear hardening curve of the 
single crystals. The only self-consistent calculation has been carried out by Taylor 
( 1938 ) for aluminium, and gave good agreement with the observed tensile curve. 
However, as Barrett & Levenson ( 1940 ) have pointed out, the change of orientation 
of approximately one-third of the crystals in the aggregate is not that predicted by 
Taylor’s theory. This lack of agreement must be due to the assumptions made. The 
main assumption on which the theory is based is that the material deforms uniformly 




96 W. Boas and M. E. Hargreaves 

irrespective of the presence of grain boundaries and differently orientated grains. 
The present investigation has shown that not only are there differences in deforma* 
tion from grain to grain, but also within any grain, thus contradicting Taylor’s 
assumption. 

Before a new attempt to calculate the stress-strain curve can be made, the con¬ 
ditions prevailing at the grain boundaries have to be established. Certainly the slip 
is not homogeneous within a crystal, and it seems even to be doubtful whether slip 
is sufficient to explain the observed deformation. According to geometrical con¬ 
siderations any arbitrary deformation of a crystal, for instance, the deformation 
of a crystal in an aggregate, requires the occurrence of slip along at least five systems. 
Experimentally only single and double slip have been observed in our aggregates. 
Further, Barrett has shown that the change in orientation is not always such as 
would result if five-fold slip took place. Since contact at the grain boundaries is 
maintained slip cannot be the only mechanism of deformation operating in the 
crystals of an aggregate. 

It appears that the deformation tends to be continuous at the boundaries, and 
if the deformation is entirely due to slip this implies quite different stresses on the 
two sides of the boundary where crystals of different orientations adjoin. Because 
of the principle of action and reaction the stress normal to the boundary and the 
shear stresses in the boundary must be continuous. Hence the mechanism of 
deformation must be such as to allow the deformation to be continuous, although 
the stresses are the same on the two sides of the boundary. In addition to the slip 
observed there must be another process for which the stress-strain relationship is 
different. The closer the approach to the boundary the greater iB the relative 
proportion of the second process, although under favourable conditions more slip 
takes place near the boundaries than at the centre. 

If five-fold slip were to operate at the beginning of plastic deformation the 
critical shear stress would have to be obtained on all five slip systems. The external 
stress necessary would be much higher than that which gives rise to the critical 
shear stress on the most favourable system for slip. It is suggested that non-slip 
deformation takes place at an external stress smaller than that necessary to initiate 
slip on the less favourably orientated of the five systems. No suggestion as to the 
nature of the non-slip deformation can yet be made from the experimental evidence. 

Although in the present experiments only comparatively small deformations were 
used, it is considered that at higher deformations the non-slip mechanism will 
become increasingly important. The inhomogeneity observed for small deformations 
will penetrate further into the crystal as deformation proceeds, and at higher 
deformations the extent of the non-slip process will become greater. With smaller 
grain size the effect will be more pronounced. It has already been suggested by 
Chalmers ( 1940 ) that a type of plasticity similar to amorphous flow becomes pre¬ 
ponderant at high deformations. 

The results of this investigation were obtained from measurements on the surfaces 
of specimens. A crystal in the interior of a polycrystalline specimen will be still more 





FimiRK 3 


Boas and Hargreaves 


Proc . Roy. Soc . ^1, volume 193, plate 2 



c 


Fig u ab 4 




Plastic deformation in the crystals of an aggregate 97 

restricted in its deformation, since it is not allowed the freedom in deformation at 
the free surface (observed as surface rumpling) and has a larger grain-boundary area 
relative to its volume. For such a crystal the non-sli|) meclfanism will therefor© play 
a greater role than for a surface crystal. The continuity of deformation may then 
even extend throughout the crystals, and it is conceivable that the strain will become 
uniform in a small-grained specimen in agreement with Taylor's assumption. This 
uniformity, however, would be the result of the non-slip process. 

i 

This work forms part of a general programme, carried out by the Section of 
Tribophysics, Council for Scientific and Industrial Research, Australia. We wish to 
thank Professor E. J. Hartung for laboratory facilities in the Chemistry School, 
University of Melbourne, Professor H. K. Womer for the use of equipment in the 
Rosenhain Memorial Laboratory, and the staff of the Munitions Supply Laboratories 
Maribyrnong, for carrying out spectrographic analyses and help in the use of 
equipment. 


References 

Aston, R, L. 1926 Proc. Camb. Phil. Soc. 23, 549-560. 

Barrett, C. S. <fe Levenson, L. H. 1940 Trane, Afner . Inst. Min . (Meiall.) Engre . 137, 112-126. 
Carpenter, H. C. H. & Elam, C. F. 1921 Proc, Roy . Soc. A, 100 , 329-353. 

Chalmers, B. 1937 Proc. Roy. Soc. A, 162, 120127. 

Chalmers, B. 1940 Proc. Phya. Soc. 52, 127-131. 

Karaop, R. <fc Sachs, G. 1927 Z. Phya, 41, 116-139. 

Taylor, G. I. 1938 J. InaL Met. 62, 307-324. 

Yamaguchi, K, 1927 InaL Phya. Chem. Res, Japan , 6 , 271-278. 

1 


Description of Plates 1 and 2 
* Plate 1 

Figure 3. Variation of slip line density in grain 4 of specimen 5 after 5 % elongation. Compare 
with figure 1 b. a, boundary area grains 3 and 4 ( x 250). 6 , centre of grain 4 ( x 250). 
c, boundary area grains 4 and 5 ( x 250). 

Plate 2 

Figure 4. Slip lines in grain-boundary areas of specimen 19 after 5% elongation. 

Compare with figure la, 

а, boundary between grains 2 and 3. No double slip in grain 2 near the boundary ( x 23). 

б , boundary between grains 1 and 2. Mainly single slip and surface rumpling in grain l. 
Double slip in grain 2 ( x 6 ). 

c, d > portion of the area shown in 6 , at higher magnification ( x 15), The same area is shown 
under different illumination in (c) and (d). 

e, note the second slip system near the boundary in grain 1 in addition to the principal seen in 6 . 
d> the extent of the seoond slip system in grain 1 is more apparent than in (c). The slip 
lines of this seoond system are almost parallel to those of one of the systems in grain 2 . 


VdL 193. A. . 


7 



An investigation of radio-frequency radiation from the sun 

By M. Ryle and D. D. Vonberg 
The Cavendish Laboratory , University of Cambridge 

{Communicated by Sir Lawrence Bragg , F.R.S. — Received 27 August 1047 
—Read 29 January 1948 ) 


It hoe boon known for some time that the sun emits radio-frequency radiation whose intensity 
greatly exceeds the value expected from a black-body at 6000° K. In the present paper, 
experiments are described in which measurements have been made of the solar radiation at 
frequencies of 176 and 80 Mcyc./sec. 

Measurement of the small powers which can bo abstracted from practical aerial systems 
requires special types of receiving equipment if absolute measurements are to be recorded 
automatically over long periods of time. An apparatus has been developed in which the 
output power of a local source of random ‘noise’ is automatically and continuously adjusted 
so as to be equal to the aerial power; in this way the receiver is used only as an indicator 
of balance, and errors due to variation of its gain or internal noise are eliminated. 

A special type of aerial has been devised which enables the solar radiation to be recorded 
separately from the galactic radiation, and so enables continuous observation of the sun 
to be made with aerials of comparatively low direotivity. 

The results obtained on these two frequencies show that the gun normally emits radiation 
whose intensity corresponds to a surface temperature of the order of 10® °K. Large fluctua¬ 
tions in the intensity occur, however, and during the passage of large sunspots, equivalent 
temperatures as high as 10® to 10® °K have been observed. In addition to these day-to-day 
variations the radiation is subject to sudden brief increases of intensity lasting only for a 
few seconds. 

Measurements of the diameter of the source, by a method analogous to Michelson’s 
stellar interferometer, have shown that during periods of very great intensity the radiation 
originates in an area of the sun of the some order of size as a sunspot. This result means that 
equivalent temperatures of 10® to 10 10 °K must exist. 

Measurements of the polarization of the radiation have shown that during periods of 
increased activity the radiation is mainly circularly polarized. 

The present account covers the experimental methods and the results obtained up to the 
present time. It is hoped to consider these results theoretically in a future paper. 


]. Introduction 

This paper describes some measurements which have been made on electromagnetic 
radiation emitted from the sun at radio-frequencies. When the work was started 
(October 1945 ), radiation from the undisturbed sun had been measured at fre¬ 
quencies of the order of 10,000 Mcyc./sec. (wave-length 3 cm.) by using an aerial of 
large equivalent aperture, having a narrow angle of reception (Southworth 1945; 
Sander 1947). It was also known that during the passage of large sunspots across 
the solar disk, radiation could be detected without the use of highly directive aerials, 
on frequencies in the neighbourhood of 00Mcyc./sec. (Appleton 1945; Hey 1946). 
The intensity under these disturbed conditions appeared to decrease as the frequency 
was increased; observation at frequencies less than 20 to 30 Mcyc./sec. was restricted 
by the effect of the terrestrial ionosphere. 

If the radiation was reoeived and amplified in a high-gain amplifier it appeared, 
in the telephones and on the screen of a cathode-ray oscilloscope, to have the 

[ 98 ] 



An investigation of radio-frequency radiation from the sun 99 

characteristics of the electrical ‘noise', which is generated in a resistor. It was 
therefore natural to assume that the field of the wave was varying in a random 
manner, os if it were emitted from a source at a high temperature. 

The intensity of the radiation from the undisturbed sun measured at a frequency 
of 10,000Mcyc./sec. was approximately that to be expected from a ‘black-body’ 
source subtending an angle of £° at a temperature of 20,000° K. At this frequency, 
no increase of intensity at times of solar disturbance had been reported. Measure¬ 
ment of solar radiation on 60 Mcyc./sec. had only been reported when the sun was 
strongly disturbed; the failure to detect radiation at other times was presumably 
partly because the intensity was so small as to be masked by the electrical ‘ noise ’ 
in the amplifier, and partly because the general radiation from the galaxy masked 
any effect of solar radiation with the aerials of low directivity which were used. 

In the experiments described in this paper, the aim was to make a more sensitive 
measuring apparatus, so that the solar radiation on a frequency of 175 Mcyc./sec. 
(wave-length 1*71 m.) could be measured continuously. This frequency was chosen 
because the correspondingly shorter wave-length provided the opportunity of con¬ 
structing aerials having better angular resolving power, and yet it was not expected 
to be too high for the observation of the increased radiation found on lower fre¬ 
quencies during times of solar disturbance. Later, observations were also made at 
a frequency of 80 Mcyc./sec., and it is hoped, in tins future, to extend the measure¬ 
ments to other frequencies. 

Since, in any theoretical interpretation of the measurements, it is of fundamental 
importance to consider the temperature of the source, it is convenient to express 
the intensity of the radiation when it reaches the earth in terms of the equivalent 
temperature of a ‘black-body’ source situated at the distance of the sun, and sub¬ 
tending a known angle at the earth. In what follows, the intensity will frequently be 
expressed in terms of the equivalent temperature of a circular source of this kind, 
subtending the same angle as the visible solar disk (i.e. 4 °)* It is by no means implied, 
as will be obvious later, that the actual source of the radiation subtends this angle, 
but this method of describing the results proves to be useful when theoretical deduc¬ 
tions are to be made. The power measured is that which can be abstracted from an 
aerial of given directivity, and the method of converting the result of this measure¬ 
ment into an ‘equivalent temperature of a £° source' will be discussed in § 2. 

The powers which are received in the comparatively non-directive aerials which 
have been used are quite small compared with the thermal power in the first circuit 
of the amplifier. It is therefore necessary to devise special circuits for the accurate 
measurement of the power from the aerial, and to consider in detail the limiting 
accuracy of such measurements. The fundamental principles which govern the 
operation of the system are discussed in § 3 . The practical design of the apparatus is 
described in Appendix 1, and the limiting accuracy attainable with such equipment 
is derived. It is shown that an aerial power which is less than 1 % of that arising in 
the amplifier input circuit can be detected under the conditions encountered in 
practice. 


7-2 



100 M. Ryle and D. D. Vonberg 

The problem of distinguishing between the radiation received from the sun and 
from the galaxy is discussed in § 4, and an aerial system which enables the solar 
radiation to be recorded separately is described. 

The remainder of this paper is concerned with the experimental results obtained 
with apparatus working at frequencies of 175 and 80Mcyc./sec. 

During the period covered by these experiments, it is natural that similar observa¬ 
tions should have been made by other workers investigating radiation from the sun 
on radio frequencies. Particular reference is made to the results obtained by the 
following workers: 

(a) Pawsey, Payne-Scott & McCready (1946); Lovell & Banwell (1946); Dicke 
& Beringer (1946). 

(b) McCready, Pawsey & Payne-Scott {1947) and Covington (1947); on the area 
of the source of increased solar radiation. 

(c) Appleton & Hey (1946a) and Martyn (1946); on the polarization at times of 
great intensity. 

The present paper is restricted to an account of our experiments and the results 
obtained. It is hoped to consider the interpretation of these results in a future paper. 

2. The expression of random e.m.f.’s in terms of temperatures 

In much of the work on this subject it has been usual to express the results in 
terms of the mean power flux at the receiver. As already explained, it is thought 
more convenient to express the received power flux in terms of the temperature of 
a hypothetical source which subtends at the earth an angle equal to that of the solar 
disk. In this section it is shown how this temperature can be deduced from measure¬ 
ments in which the radiation received on a moderately directive aerial is compared 
with the power originating in a local source of electrical ‘noise'. 

It is first recalled that, according to Nyquist (1928), the available power which 
may be abstracted from a resistance at a temperature T is given by the expression 
kTSf y where k is Boltzmann’s constant and df the frequency band oyer which the 
energy is integrated. Similarly, the power which may be abstracted from an aerial 
mounted in a black-body enclosure at a temperature T is also given by kTSf (Burgess 
1941). If such an aerial is connected by a matched transmission line to a resistance, 
equilibrium will be attained when the resistance is at the same temperature as the 
enclosure. By adjusting the temperature of the resistance until no power flows, it 
is thus theoretically possible to determine the temperature of the enclosure. 

Simple thermodynamic arguments indicate that the state of affairs is not altered 
if the aerial is directive; equilibrium is still maintained when the temperature of 
the resistance equals that of the enclosure. If, however, instead of receiving radiation 
from an enclosure, the directive aerial is irradiated from a black body at a tem¬ 
perature Tjg, which subtends a solid angle <0. then in order to maintain equilibrium 
between the resistance and the aerial, the temperature of the resistance must be 
adjusted to a new value T A which depends on the directivity of the aerial. If 



An investigation of radio-frequency radiation from the sun 101 

represents the reoeptivity of the aerial per unit solid angle in the direction ( 0 , <j>) 
compared with the receptivity in tfye direction of the source (and assumed to be 
constant over the area of the source), then T^ is given by 

if wllore = j 'jf{ 0 ,<p)d 0 d(j>. 

(The quantity 47r/t2 corresponds to the power gain of the aerial in the direction of 
the source, oompared with that of an isotropic radiator.) 

The quantity T A will be known as the ‘equivalent aerial temperature’, and 
represents the temperature of the attached resistance when equilibrium is established. 

3. The measueement of small random e.m.f/s 

In order to measure the small powers available from the aerial system, it is neces¬ 
sary to use an amplifier whose input circuit itself gives rise to thermal c.m.f.’s 
corresponding to an effective temperature which will be denoted by T u . In the 
absence of an input from the aerial, the deflexion of the output indicator varies in 
a random manner about a mean value which corresponds to T R > The instantaneous 
deflexion may have any magnitude, but the mean value obtained from a large number 
(n) of instantaneous readings departs from the true mean by an amount proportional 
to 1 /y/n. If the time constant of the circuits preceding the output indicator is t if 
then it is possible to obtain \jt i independent readings per second. If the time con¬ 
stant of the output indicator itself is increased to t 0 , the output indicator will record 
an average over the time t {) , so that it will record effectively a number n (~ t u jt { ), 
of independent readings. Then, by the previous analysis, it can be seen that the 
reading will fluctuate about the true mean value by an amount proportional to 
l/> = It is often more convenient to consider the band widths Sf ( — 1/^and 

6 f 0 « 1 jt 0 of the circuits, and the fluctua tions about the mean may then be expressed 
as proportional to V(^/o/^/;)- A similar analysis has been made by Dicke (1946). 

If now a small random e.m.f., corresponding to an equivalent aerial temperature 
T a , is injeoted, the output deflexion will vary in a random way about a new mean 
value corresponding to T n + T A . The increase in the observed mean value can be 
detected if the fluctuations are less than this increase. It can therefore be seen that, 
apart from other considerations, the smallest value of T A which can be detected can 
be reduced indefinitely by making the ratio of the band widths Sf 0 lSf f small enough. 
There are, however, serious practical limitations to this ratio. In the first place the 
input band width Sf must be kept small compared with the centre frequency f u 
because one aim of the experiments is to investigate the variation of the intensity 
with frequency.* In the second place the time constant of the output circuit cannot 
be made too large, because it is desired to record rapid changes in the intensity of 
the radiation. In practice the maximum value of SfJSfo is about 10 6 , so that it should 
be possible to detect values of T A which are of the order 10~®T je . 

* It will also be shown in §7 that a measurement of the diameter of the source requires 
a reasonably small band width. 



102 


M. Ryle and D. D. Vonberg 

In the frequency range 50 to 3000 Mcye./sec. the lowest attainable values of T R 
are of the order of iOOO to 6000 ° K. The limit of detection in this range is therefore 
of the order of a few degrees. 

If an aerial system having an effective solid angle fl is used for observing a source 
at a temperature T s subtending a solid angle <o, the equivalent aerial temperature 
has been shown to be T A = T s a)j£l. For a source subtending an angle of and, with 
aerials of reasonable size, to[L2 lies between 10 -4 and 10~ 5 for the lower frequencies, 
and 10~ 2 and 10~ 3 for the higher frequencies. The smallest measurable equivalent 
temperature of a source subtending an angle of is therefore of the order of 10 4 to 
I0 5 °K at 50 Mcye./sec. and 10 ® °K at 3000 Mcyc./sec. 

The practical achievement of such a sensitivity involves problems which are 
discussed in Appendix 1. 

4. The separation of solar from galactic radiation 

Radiation from the galaxy causes a simple aerial at a frequency of 175 Mcyc./sec. 
to have an equivalent temperature which never falls below about 200° K, whilst 
the corresponding figure for 80 Mcyc./sec. is about 1000° K. If accurate measurements 
of solar radiation are to be made, it is therefore necessary to use a sufficiently direc¬ 
tive aerial system to provide, from the sun, an e.m.f. comparable with that produced 
by the galaxy. An aerial system of this kind would be inconveniently large at these 
wave-lengths, and would have the further great disadvantage that continuous 
rotation would be necessary for observation of solar radiation throughout the day. 



Figure 1. Interference pattern produced by two aerials separated by ten wave-lengths. 

An alternative system has therefore been devised which allows continuous obser¬ 
vation of the solar contribution separately from that due to the galaxy, without 
the need for a highly directive aerial. In this system two fixed aerials are used, 
separated by about ten wave-lengths, and situated on a line running east and west. 
Such an arrangement produces an interference pattern having minima separated 
by about 6°, as shown in polar co-ordinates in figure 1. When a source such as the 



An investigation of radio-frequency radiation from the sun 103 

sun, which subtends a small angle, moves across such an interference pattern, the 
power reoeived varies in a periodic manner from a minimum value, which is virtually 
zero, to a value twice that which would be received by a single aerial. On the other 
hand, a source, such as the galaxy, which subtends an angle large compared with the 
separation of the interference minima, gives rise to a nearly constant power. The 
solar contribution then appears on the recorded trace as a periodically varying 
signal, whose amplitude may be determined accurately even when it is smaller than 
the galactic oomponent. 

Practical details of the aerials used in this system are given in Appendix 2 (a), 
and a typical record obtained during a period of increased solar activity is shown 
in figure 2. 

20000 

15000 

S' 

10000 

g 5000 

a 

I 1000 

„ 1100 hr 12.00 hr GMT 



Figure 2. Record of solar radiation obtained with spaced aerial system. 

25 January 1947. (a) 80 Moyc./see.; (6) 175 Mcyc./sec. 

5. Operation of toe complete system 

Details of the apparatus which has been designed for measuring the small e.m.f.’s 
available from the aerial system, are given in Appendix 1. The e.m.f. from the aerial 
is compared with that from a local source of noise power, and it is arranged that the 
latter is continuously and automatically varied to maintain a balance. A diode 
operating under temperature-limited conditions is used as the looal source of noise 
power, and the output of such a generator is directly proportional to the mean anode 
current (Schottky 1922). The anode current may therefore be used as a measure of 
the power available at the termination of the aerial feeder and, after correction for 
the feeder losses (see Appendix 2 (a)), as a measure of the equivalent aerial tem¬ 
perature. 

The anode current of the diode is therefore used to operate a pen recorder, which 
then gives a continuous record of the equivalent aerial temperature. By observ ing 




104 M. Ryle and D. D. Vonberg 

the amplitude of the fluctuating component of this trace, as the spaced aerial system 
moves with the earth’s rotation, it is possible to determine that part of the equivalent 
aerial temperature which is due to solar radiation. 

It has been shown in § 2 that if the effective solid angle (Q) of the aerial system is 
known, it is possible to convert the observed values of equivalent aerial temperature 
into the equivalent temperatures of a black-body source subtending an angle of £°. 
The method of estimating il is discussed in Appendix 2 (a), and with this knowledge 
the readings of the recorder chart can be converted into 'equivalent temperatures 
of a source'. 

This, in outline, is the system which has been used at frequencies of 175 and 
80 Meyc./sec. in the experiments to be described in the remainder of this paper. 

6. The temporal variation of intensity 

It is convenient to divide the discussion of the results into (a) short-lived ‘ bursts ’ 
of radiation, (b) sudden large increases of intensity, observed during periods of 
increased solar activity, and (c) the day-today variation of mean intensity observed 
over a period of several months. 

(a) 1 Bursts ’ of radiation 

During times of increased solar activity the radiation is characterized by the 
presence of brief periods of increased intensity. These ‘bursts’, as observed on an 
apparatus having an output time constant of about half a second, usually last for 
1 to 20 sec., and may have an intensity of 2 or 3 times the mean intensity at 
175 Mcyc./sec. or 5 to 10 times the mean intensity at HOMcyc./sec. Examples of 
this type of burst can be seen in figure 2. 

An examination of large numbers of such records has shown that there is apparently 
no correlation between the times of individual bursts of this type on the two 
frequencies. 

(b) Sudden increases of the radiation 

At times there are sudden large increases of intensity, which may attain values 
10 times (for 175 Mcyc./seo.) and 20 to 100 times (for HOMcyc./sec.) the mean in¬ 
tensity and may last for many minutes. Whilst it is probable that no real distinction 
can be drawn between these disturbances and the small bursts described above, 
it is usual for the larger increases to be observed on both frequencies although the 
time of commencement may differ by a few minutes. It is also common for these 
sudden increases to be associated with solar flares and the ‘catastrophic’ type of 
ionosphere disturbance which produces radio fade-outs on short waves and enhance¬ 
ments on very long waves. A typical example of a sudden increase is shown in 
figure 3 . 

(c) Day-to-day variations of mean intensity 

The equivalent temperatures of a source (see § 2) averaged over the four midday 
hours of each day, are plotted in figure 4 for the period covered by our observations. 
It is first of interest to consider the ratio of the intensities at the two frequencies 




Dec. Jan. Feb. Mar. Apr. May Jun. Jul. 1947 

Figure 4. Day-to-day variation of equivalent 4° source temperature. Curve (i) 80 Mcyc./sec, 
(!T W ); (ii) 175 Mcyc./sec. {T m ); and (iii) ratio of 80 and 175 Mcyc./sec. (T w /T in ). 




106 M. Ryle and D. D. Vonberg 

some extent related, although on some occasions the intensities may show indepen¬ 
dent variations. 

It can be seen from figure 4 that the long-term average of the ratio T B 0 /T l76 is of 
the order of 4 , which is in accord with previous suggestions that the equivalent 
temperature of the source is proportional to the square of the wave-length. There 



Figure 5. Curve showing periodic variation of T m / T iu * 



An investigation of radio-frequency radiation from the sun 107 

is, however, a marked variation about this value, which must represent a true change 
of the ratio of the intensities and cannot be the result of selective effects of the 
ionosphere (as originally suggested by Appleton & Hey ( 19466 ), since any such effect 
would be inappreciable at these frequencies. 

There is a suggestion, in figure 4, of a 27-day recurrence tendency in the ratio 
T 6Q /T m > such as might be associated with the period of rotation of the sun. In order 
to show up this tendency more clearly, the ratio curve in figure 4 has been plotted 
in a different manner in figure 5. The apparent periodicities of 28, 27 and 26*8 days 
noticeable on the diagram might be attributed to the presence of active regions at 
different solar latitudes, since it is known that the rotation period depends on the 
latitude. Since the main departures from the mean value shown in figure 5 have 
a duration of the order of 2 to 3 days, any explanation would involve the assumption 
of directed emission of the radiation with an angular distribution which was different 
at the two frequencies. It has already been suggested that the results shown in 
figure 4 lead to the conclusion that the day-to-day variations at the two fre¬ 
quencies are to some extent related, and if the recurrence tendency of figure 5 is 
substantiated by further measurements, the results will seem to suggest that the 
intensity of the radiation at the earth is determined jointly by: 

(i) the general excitation conditions, which affect both frequencies in a similar 
manner, and are apparently of a random nature, and 

(ii) the angular distribution of the radiation omitted by the sun, which may be 
different at the two frequencies. 

7. Measurement of the biameter of the source 

In July .1946 the solar radiation became, for a short time, unusually intense, and 
the opportunity was taken to measure the diameter of the source at a frequency of 
175 Mcyc./sec. The method adopted was an extension of the spaced-aerial system 
used to eliminate galactic radiation. By increasing the spacing between the aerials, 
the angular separation between the minima of the interference pattern can be 
progressively decreased, and when this angle becomes comparable with the angular 
diameter of the source, the received c.m.f. no longer falls to zero at the minima. 
By measuring the ratio of maximum to minimum e.m.f. obtained with a series of 
different aerial spacings, the effective diameter of the source can be determined. 
This method is analogous to Michelson’s method for determining stellar diameters, 
the spaced aerials taking the place of the spaced mirrors, and the observed values 
of maximum to minimum cor responding to the ‘ visibility ’ of the interference fringes 
in the optical case. 

The experiments were carried out using two arrays (each of eight half-wave 
dipoles in front of a reflecting screen) mounted on an east-west line, with the axes 
of maximum gain aligned on the sun at noon. Observations were made with spacings 
of 25,90 and 140 wave-lengths; a typical record obtained with a spacing of 140 wave¬ 
lengths is shown in figure 6 . 



108 


M. Ryle and D. D. Vonberg 

With a fixed spacing between the aerials, a small change of frequency will cause 
a shift of the interference pattern at angles away from the axis of the system (south), 
Owing to the finite width of the frequency spectrum which is acoepted by the 
amplifier, the maximum to minimum ratio observed when the sun is at an appreciable 
angle from south will therefore be less than the true value corresponding to the 
diameter of the source. Calculation shows that for a spacing of 140 wave-lengths 
and a band width of 2 Mcyc./sec., observations must be made within 40 min. of noon. 
This effect is analogous to the production of coloured fringes away from the axis of 
an optical interferometer. 



1130 12.00 GMT. 

Fxoviib 6. Record obtained with aerial spacing of 140 wave-lengths. 26 July 1946. 

After subtracting the galactic contribution (obtained from records using a small 
aerial spacing) from records of the type shown in figure 6, the midday value of the 
maximum to minimum ratio corresponded to a source of diameter 10 min, of arc. 
The effects of impedance mismatch and unbalance in the aerial system are dis¬ 
cussed in Appendix 2 (b) t in which it is shown that such effects can only give rise to 
an overestimate of the diameter of the source. It is therefore concluded that the 
source of this intense radiation subtended an angle, in the, east-west plane, not 
greater than 10 min. of arc. 

This result indicated that the effective temperature of the souroe was considerably 
greater than that estimated for a source subtending an angle of \°> and for radiation 
at a frequency of 175 Mcyc./sec. was at least 2 x 10 9 °K. 

8. The polarization of the radiation 
(a) Measurement of polarization 

Experiments have been made to determine the polarization of solar radiation at 
a frequency of 176 Mcyc./sec. Whilst the complete analysis of a mixture of polarized 
components requires a rather complicated experiment, preliminary measurements 





An investigation of radio-frequency radiation from the sun 109 

with an aerial system capable of rotation about an axis normal to the wave front 
showed the absence of plane polarized components. In addition, it has been found 
that on most occasions when the intensity is large, the polarization is completely 
circular. Measurements have therefore been restricted to a determination of the 
intensities of the two circularly polarized components. 

Two systems have been used; the first is a development of the spaced aerial system 
and is suitable for the accurate measurement of the degree of polarization even 
when the intensity is smaller than that due to the galaxy; the second is not suitable 
for quantitative measurements except when the solar radiation is large, but is 
rapid in operation and therefore provides information on the polarization of 
individual ‘bursts* of radiation. 



In the first method two simple linear aerials were used, spaced 10 wave-lengths 
apart and with their axes of polarization perpendicular to each other, Figure 7 
shows how the arrangement would appear when viewed from the direction of the 
sun at midday. If the radiation incident on this system were randomly polarized, 
as from a source of pure thermal noise’, the radiation received on the two aerials 
would be incoherent, so that the resultant power would not vary as the path difference 
from the source to the two aerials was altered. There would therefore be no periodic 
variation of the resultant power as the sun moved past the aerial system. The 
existence of a fixed phase relationship between the horizontal and the vertical 
components of the field in the incident wave would result in a periodic variation of 
the resultant power as the relative phase at the two aerials changed. The direction of 
the sun relative to the line between the centres of the two aerials, at the moment 
when the power reached a maximum enabled a distinction to be made between the 
incidence of right- and left-handed polarized radiation. By comparing the amplitude 
of the variation observed on the record with that obtained with two similar aerials 
having the same polarization (as in § 4), it was also possible to determine the extent 
to which the radiation was polarized. 

In the second method, two aerial systems with perpendicular planes of polariza¬ 
tion were mounted with their axes coincident, but displaced by a quarter of a wave¬ 
length in a direction normal to the incident wave front. If the outputs of these two 
aerials were combined, the incidence of a circularly polarized wave of one sense then 
gave an output power double that of a single aerial, whilst the incidence of a wave 



110 M. Ryle and D. D. Vonberg 

of the opposite polarity caused complete cancellation, and no output power was 
obtained. By combining the outputs of the two aerials in such a way that an extra 
half wave-length was introduced in one path, the behaviour could be reversed, and 
the system could be made sensitive to the opposite polarization. For a randomly 
polarized wave, equal power outputs were obtained in the two arrangements. By 
comparing the powers obtained with the two methods of connexion it was therefore 
possible to determine immediately the sense, and the degree of circular polarization 
in the incident wave. In the presence of galactic radiation, however, it was not 
possible to assess the ratio accurately except when the intensity of solar radiation 
was large. 

The results obtained with these systems at a frequency of 175 Moyc./sec. have 
shown that at times of large intensity (associated with an increase of activity in 
a particular source region) the degree of circular polarization is nearly always 
complete. The sense of the polarization is different for different active regions and 
it is a possible conclusion that the radiation from any particular source is fully 
polarized; the simultaneous reception of radiation from two or more sources (as at 
times of no special activity) would then give rise to resultant fields showing 
incomplete polarization. 

It has been found that in nearly all cases the polarization of the short-lived bursts 
described in §6 (a) is of the same sense as that of the background radiation. This 
suggests that these bursts may arise from brief increases of the excitation of regions 
already responsible for the general radiation. It is possible that the general back¬ 
ground of radiation is, in fact, the resultant of a large number of short bursts, only 
the larger of which can be resolved on an apparatus with an output time constant 
of the order of half a second. 

(6) Correlation with solar magnetic fields 

It is natural to suggest that the production of circularly polarized radiation is 
related to the magnetic fields of the sun, and in particular to the magnetic fields of 
sunspots which cause radiation of high intensity. It is of importance to seek a relation 
between the sense of the polarization and the direction of the field in the region of 
the source. It may be shown that a clear-cut correlation of this kind would enable 
a distinction to be made between two possible regions for the production of the 
radiation. 

An attempt has therefore been made to relate the sense of the polarization to the 
sense of the magnetic field of the sunspot as determined by observations of the 
Zeeman effect. During the period of these experiments, sunspot activity was con¬ 
siderable, and it was usually not possible to locate, with certainty, the sunspot 
responsible for the increased radiation. In addition, the complex nature of the 
magnetic fields of most large sunspots usually made it necessary to know the position 
of the source region within the spot group. It was sometimes possible to make an 
estimate of the position of the source by observation of the variations of the intensity 
as the spot group crossed the meridian, but it is clear that a conclusive result is only 



An investigation of radio-frequency radiation from the sun 111 

likely to be obtained by making a large number of such observations, or alter¬ 
natively by using an aerial of high resolving power. Out of sixteen observations 
analysed in this way, nine corresponded to radiation having the polarization of the 
ordinary wave of Appleton's magneto-ionic theory for electrons, three corresponded 
to the extraordinary wave, and four were inconclusive. (For a sunspot having a 
magnetic dipole with its north-seeking pole directed away from the centre of the 
sun, the ordinary wave has left-handed polarization, i.e. the electric vector rotates 
anti-clockwise when viewed along the positive direction of propagation.) 


9. Summary of experimental results 

It has been found that the sun at all times emits radio-frequency radiation whose 
intensity exceeds that expected from a source at 6000° K. The minimum intensity 
observed for frequencies less than 200Mcyc./sec. corresponds to radiation from a 
black-body source at a temperature of the order of 10 6 °K and subtending an angle 
of£°. If the radiation can only escape from certain restricted areas of the splar disk, 
the equivalent temperature must be correspondingly higher. 

During the appearance of sunspots, particularly when these are near the meridian, 
the intensity may be much increased, reaching values corresponding to a source 
at a temperature of the order of 1() 8 °K for 175 Mcyc./sec., and 10 9 °K for 80 Mcyc./sec. 
radiation. Experiments have shown that the radiation under these conditions is 
emitted from a region whose size is of the same order as the visual sunspot. The source 
regions must therefore have equivalent temperatures of at least i x 10 9 °K for 
175 Mcyc./sec. radiation. Measurements of the diameter of the source at lower 
frequencies have not been made, but it is possible that still higher equivalent 
temperatures may be necessary to acoount for the observed intensities. 

During the periods of greater activity, the radiation is characterized by sudden 
brief increases of intensity. These ‘ bursts 9 of radiation may have intensities as great 
as 100 times the mean value, with durations of a few seconds to many minutes. 
The larger bursts (which are usually observed nearly simultaneously on both 175 
and 80 Mcyc./sec. and are often accompanied by other evidence of solar activity) 
may represent an increase in source area, but it seems probable that sudden increases 
of the equivalent temperature must be involved. 

The radiation during periods of increased intensity has been found to be circularly 
polarized. 

The authors wish to express their appreciation of the guidance and encouragement 
of Mt J. A. Ratcliffe throughout this work. We would also like to acknowledge the 
great assistance of Messrs K. E. Machin and H. M. Stanier and others in the 
Cavendish Laboratory in the design and construction of the experimental apparatus. 
For the supply of solar data and for much useful discussion we are indebted to 
Professor F. J. M, Stratton and the staff of the Solar Physics Observatory and to 



112 M. Ryle and D. D. Vonberg 

Mr H. W. Newton of the Royal Observatory, and for providing special magnetic 
data we have to thank Mr S. B. Nicholson of the Mount Wilson Observatory. 

The work was carried out during the tenure of an LC.L Fellowship by one of us 
(M.R.), while the other held the Oliver Lodge Scholarship of the Institution of 
Electrical Engineers. We owe the generous supply of apparatus and other facilities 
to a grant from the Department of Scientific and Industrial Research. 


Appendix 1. The design of apparatus for 

MEASURING SMALL RANDOM K.M.F.’s 

(a) Practical difficulty of achieving theoretical sensitivity 

It was shown in § 3 that it is theoretically possible to detect e.m.f.’s very much 
smaller than the random e.m.f.’s originating in the input circuit of the amplifier used. 
(If T a and T n represent the equivalent temperatures of the signal and input circuit 
T a > ^(Sf 0 /8fi) T n , where Sf t and 8f 0 are the input and output band widths respec¬ 
tively.) The possibility of achieving such a sensitivity in practice will now be 
considered, with particular reference to the measurement of solar radiation, where 
automatic operation over long periods of time is necessary. 

The use of a narrow band width filter on the output of a conventional receiver 
makes it possible to reduce the residual fluctuations of the output to a value which 
corresponds to the required limit of detection, but the mean value of the output is 
directly proportional to the receiver gain, and thus a change of gain of the order of 
0-1 % will produce a comparable change of output reading. The maintenance of the 
gain of a high-gain amplifier to such an order of accuracy is not practicable without 
periodic calibration. In addition, a change of T R (such as may occur as the result 
of ageing of the first valve) results in errors which are independent of those due to 
changes of gain. 

These difficulties have been largely overcome by Dicke { 1946 ) in a method which 
periodically replaces the aerial by a load resistance at room temperature. If the load 
resistance and aerial present equal impedances to the amplifier input, the amplitude 
of the square-wave output when such a switching is carried out at a high speed, 
provides a measure of the difference between room temperature and the equivalent 
aerial temperature. 

The output reading is, however, still proportional to the gain of the receiver, 
which must therefore be kept constant. The output also depends on the law of the 
detector, and, in general, calibration of the apparatus is necessary. Any such 
calibration must be made with an e.m.f. having a wave form similar to that of the 
e.m .f. to be measured. The use of such a calibration source also renders it unnecessary 
to correct for variation of the amplifier gain across the band of frequencies accepted 
and is therefore desirable even when the law of the detector is known. 

A method has been developed wbioh overcomes these difficulties and, besides 
giving an output reading which is independent of the gain and frequency response 



An investigation of radio-frequency radiation from the sun 113 

of the amplifier, and the law of the detector, has the additional advantage of being 
capable of operating over a wide range of input powers without the necessity for 
calibrated switching of the gain. 

( 6 ) Description of the system adopted 

In this system a local source of noise power is continuously adjusted so as to be 
in equilibrium with the aerial. This adjustment is achieved by alternately switching 
the amplifier input between the aerial and the noise source. If the impedances of 
the aerial and the noise source are equal, equilibrium between noise source and aerial 
is attained when equal power flows at the amplifier input in the two switch positions. 
This condition is independent of the impedance and the equivalent temperature of 
the amplifier input. If the powers introduced from the aerial and from the noise 
source are not equal, the output of the detector will vary at the switching frequency, 
and this output voltage can be used to adjust the effective temperature of the noise 
source until equilibrium is attained. Such adjustment does not depend on the gain 
or frequency response of the amplifier, or on the law of the detector. Figure 8 is 
a schematic diagram of the system. The accuracy with which measurements'can be 
made with such a system will be discussed in the next section. 



Figubk 8 . System for the continuous measurement of small radio-frequency powers. 

In practice, the input powers involved are inconveniently large for production in 
a heated load resistance, and a diode operating under temperature-limited conditions 
provides a more convenient source of noise power, control of the output being 
achieved by adjustment of the filament temperature. 

The r.m.s. noise current in a band width 8f flowing in the anode circuit of a 
temperature-limited diode has been shown by Sohottky ( 1922 ) to be given by the 
expression <J(2eISf) y where I is the mean anode current and e the electronic charge. 
If this noise current flows through a resistance r, the noise power generated in the 
resistance is given by 2 elr$f In order to provide a source of noise power which 
presents the same impedance as the aerial, it is necessary to terminate the feeder 
from the switch to the noise source in a matching resistance (R). The diode noise 


VoL 193 * A. 


8 







114 M. Kyle and D. D. Vonberg 

current now flows through a resistance of £i?, and the available diode noise power 
which can be abstracted from the feeder is given by je/R&f, which corresponds to 
a temperature T n of the terminating resistance given by 

kSf ~ 2k 9 

where k is Boltzmann’s constant. 

The total available power therefore corresponds to a temperature T 0 + eIR/2k t 
where T Q is the actual temperature of the resistance. 

The equivalent aerial temperature may thus be measured in terms of the mean 
anode current of the diode. 

The operating range of the instrument may be altered by inserting an attenuator 
between the diode noise souroe and the change-over switch. The effective tem¬ 
perature of such a system can be shown to be (T i} + T 0 )a + 7},(1 —a) — T 0 + tzT Oi 
whore T n — eIR/2k, T 0 = room temperature and a = power attenuation constant. 
Under these conditions the sensitivity, expressed in degrees per milliampere of 
diode anode current, is determined only by a, and if the gain of the amplifier is 
automatically maintained at the optimum value by a long time constant automatic 
gain control circuit, the change of the attenuation constant is the only adjustment 
required to change the operating range. 

The finite time taken to alter the temperature of the diode filament results in a 
time lag in the correction of a misalignment between aerial power and diode power, 
and this may lead to instability of the system. This effect is analogous to the in¬ 
stability of all types of servo-mechanism containing 1 inertia and may be eliminated 
by applying to the diode filament an additional voltage proportional to the rate of 
change of the anode current. 

(c) Analysis of the accuracy of the system 

I n this section the sources of error in the system will be considered and an estimate 
made of their magnitude. 

(i) * Follounng ’ accuracy 

In order to provide the necessary control voltage to the filament of the noise 
diode, a small misalignment must exist between the powers reaching the amplifier 
input from the aerial and from the noise souroe. The actual error in the mean recorded 
reading which is introduced depends on the preset bias voltage applied to the diode 
filament in the absence of an input signal, but the maximum error which can occur 
with any bias setting can be estimated. 

If the overall gain of the amplifier is 0 , and the output voltage which must be 
applied to the filament of the diode to vary the noise output from zero to maximum 
is V } the maximum error in the recorded reading of equivalent aerial temperature is 
given by ST A — V/O. It is therefore important that the overall gain should be as large 
as possible. The gain is limited by overloading of the output stage of the amplifier 
(on the assumption that the necessary conditions for stability—as discussed in the 



An investigation of radio-frequency radiation from the sun 115 

previous paragraph are satisfied). If the injected e.m.f.’s are small compared with 
the random fluctuations originating in the input circuit of the amplifier (i.e. r T A T 1 ^), 
the signal at the output stage consists of a steady voltage ( GST A ) due to the input 
misalignment, and a random fluctuation of approximately QT n !<Jn, where n « $/*/£/„. 
If the output stago overloads at an output voltage of V r , the maximum gain is 
limited to a value given by G(ST A -f T u j*Jn) ~ F\ The maximum error in the mean 
value of the recorded reading is therefore given by 

ST A IV^(ST A ^T 1{ l^n)IV\ 
therefore ST A - [V&V - V)] T^n. 

In practice it is not convenient to make V’ many times F, so that a static ‘fol¬ 
lowing 1 error of the same order of magnitude as the residual fluctuations of amplifier 
noise {T }i /*Jn) is to be expected. Although such a gain could be used satisfactorily 
under static conditions, the response to rapid changes of equivalent aerial tem¬ 
perature would be poor owing to the fall of gain which would result from overloading 
of the amplifier with a steadily increasing input from the aerial. An evaluation of 
the optimum value of gain requires a detailed analysis of the response of the system 
to various wave forms of the signal envelope, such as a step function and a con¬ 
tinuously increasing signal (‘ velocity lag') but with a fixed value of F'/F, it becomes 
necessary to reduce the gain considerably, particularly if it is required to follow 
changes comparable with the heating time of the diode filament. For a response 
time of the order of 0*2 sec,, a reduction of gain of approximately 3:1 has been found 
most suitable for the particular diode used, and the ‘following' error will therefore 
be of the order of 3 T n l<Jn, 

At a frequency of 175Mcyc./sec. 5T W ^3000° K, and for an input band width of 
2Mcye./sec. and an output band width of 5 eye./sec. the error will bo about 15° K, 

With input e.m.f.’s corresponding to aerial temperatures greater than T tv the 
output of the amplifier is given by G(ST A + T A j^n) thus giving a following error of 
the order of 3 T A f^Jn. With large signals a proportional error is therefore introduced 
which has a value, for the same band width constants, of about \ %. 

(ii) Effect of a mismatch of the aerial 

If the aerial and noise diode present unequal impedances to the amplifier input 
circuit, the noise originating in the input circuit will develop a different input e.m.f. 
in the two switch positions. Thus although a negligible loss of aerial power may occur 
as a result of such a mismatch, the change of amplifier noise may have the effect of 
displacing the effective aerial temperature by a constant increment. The apparatus 
will then still read changes of aerial temperature correctly, but will have a displaced 
zero proportional to the noise power in the first circuit of the amplifier. An accurate 
evaluation of this error involves a knowledge of the noise power which can be 
affected by a change of the shunt impedance (i.e. noise due to circuit losses and transit 
time but excluding the anode current shot effect) and also a knowledge of the value 
of impedance mismatch which gives the best noise factor. For frequencies in the 


8 -* 



116 


M. Ryle and D. D. Vonberg 

neighbourhood of 175Mcyc./sec., the grid circuit noise (including that due to a finite 
transit time) is likely to be of the same order as the shot noise, and an equivalent 
temperature (7^) of 1500°K can be assumed. 

For the case of a ‘ matched ’ input, the variation of the input e.m.f. to the amplifier 
due to grid circuit noise, as small departures from the matched condition are made, 
can be shown to correspond to changes of aerial temperature of the order of 
T f n {\ — 4/(1 -fy) 2 }, where y is the ratio of the aerial impedance to the matched 
value. 

For a value of y of I • 1 and T' n = 1500° K, the error corresponds to a displacement 
of the equivalent aerial temperature by 150° K. It is therefore necessary to maintain 
the aerial impedance accurately equal to the impedance of the diode noise source, 
if absolute determinations of equivalent aerial temperatures of a few hundred degrees 
are required. It has been shown in § 4, however, that providing changes of equivalent 
aerial temperature can be determined accurately, absolute measurements to this 
order of accuracy are usually unnecessary since the incidence of galactic radiation 
makes the direct measurement of solar radiation impracticable with a simple aerial 
system. 

(iii) Output of diode noise source 

The sytem relies on a knowledge of the absolute noise output of the diode in terms 
of the mean plate current. This requires a knowledge of the output of the diode itself, 
together with the effect of the attenuators used, including any impedance trans¬ 
formation which these may introduce. 

The noise power generated by a temperature-limited diode is proportional to the 
load resistance through which its anode current flows. At low frequencies this may 
be taken as the external load applied at the valve base, and can be accurately 
measured. At frequencies of 200Mcyo./sec. and higher, the lead inductance and 
stray capacity of the valve structure may introduce an appreciable impedance 
transformation which will give rise to a proportional error in the power output. 
Measurements at 175Mcyc./sec. have shown that this error is unlikely to exceed 
10 % with the normal type of valve with a pressed-glass base (such as the CV172) 
but at frequencies higher than 300Mcyc./sec. serious errors may arise unless a more 
suitable type of valve is available. Suoh errors could then only be eliminated by 
calibration against another standard of radio-frequency power. 

(iv) Overall accuracy of the system 

Apart from the initial calibration of the diode noise source which may be necessary 
at the higher frequencies, the theoretical overall accuracy of the automatically 
balancing system described in this appendix has been shown to be of the order of 
± (150° + \ %), with a differential acouracy of the order of ±(15° + £%). The 
fluctuations at the output due to noise originating in the amplifier should be of the 
order of ± 5° K, These figures are applicable for a value of T n of 3000° K, and a value 
of n (the ratio of input/output band widths) of 4 x 10 6 . 



An investigation of radio-frequency radiation from the mn 117 

(v) Measured performance of apparatus 

Measurements carried out on the system using an amplifier having a noise factor 
of 11 db. (T n = 4000° K), with an input band width of 4Mcyc,/sec. and an output 
band width of 4 eye./sec, (i.e. n = 10 6 ), gave the following results: 

Fluctuation of output reading: 0*5 + 2° K (the calculated values is 4°K). 

Maximum following error (with gain reduction factor of 0*3): 18 ± 2° K for small 
signals, and 0*4 + 0*05% for large signals (the calculated figures are 13° K and 
0-33 %). 

It may therefore be seen that the performance figures which can be achieved in 
practice are in good agreement with theory. 

Appendix 2. The design of aerial systems for 

MEASURING SOLAR RADIATION 
(a) Determination of equivalent source temperature 

It has been shown in Appendix 1 how it is possible to measure the equivalent 
temperature of the output of an aerial feeder in terms of the output of a diode noise 
source. The equivalent aerial temperature can be deduced by correcting for the loss 
in the aerial feeder. If this loss causes a reduction of the power by a factor ft, the 
apparent equivalent temperature of an aerial having a true equivalent temperature 
T a can be shown to be j$T A +• (1 — fi) T 0 , where T 0 is the temperature of the feeder. 

It was shown in Appendix 1 that the use of an attenuator (a) between diode and 
switch reduces the effective temperature of the diode (T J} + T 0 ) to % + ccT I} . 

Thus equilibrium will be attained when 

i.e. T A = *T 1> + r 0 . 

In order to determine the equivalent temperature of a |' J source, ha ving measured 
the equivalent aerial temperature, it is necessary to know Q, the integral over a 
sphere of the receptivity of the aerial system relative to the receptivity in the 
direction of the source. An attempt to determine Q directly, by measuring the 
power gain in the direction of the source, would be liable to serious errors due to 
ground reflexions unless carried out with the aerial mounted at the correct angle of 
elevation. The measurements would involve the use of an elevated source, and 
owing to the practical difficulty of providing such a source, it was found more 
convenient to calculate from the polar diagram of the aerial system. 

The aerial used on I75Mcyc,/sec. consisted of an array of eight half-wave dipoles 
mounted JA in front of a reflecting screen. When such an aerial is used to observe 
the radiation from a source, the relation between source temperature and aerial 
temperature (il/a>) for a source situated in the maximum of the polar diagram is 
6*4 x 10®. If two such aerials are used for measuring solar radiation separately from 



118 


M. Ryle and D. D. Vonberg 

the galactic background, the difference between maximum and minimum of the 
equivalent aerial temperature as the interference pattern is swept past the sun must 
therefore be multiplied by 3*2 x 10 3 in order to give the equivalent temperature of 
a |° source. If such observations are made at times other than near noon (when the 
sun is near the maximum of the polar diagram of each aerial in the horizontal plane), 

H- iox - 




it is necessary to correct for the reduction in gain. Similarly, the elevation angle of 
the aerials must be adjusted throughout the year so that the maximum of the 
vertical polar diagram also coincides with the direction of the sun at midday. The 
errors which are likely to be introduced by such causes, and in the estimation of 12, 
make it difficult to determine the equivalent temperature of a source to an accuracy 
of better than ± 10 %. 







An investigation of radio-frequency radiation from the sun 119 

On 80Mcyc./sec. the larger size of the aerials necessitated a somewhat simpler 
arrangement, and only four half-wave elements were used in each half of the system. 
The feeding arrangements for the two types of aerial are shown in figures 9 and 10. 

With aerials of this type, the smallest detectable temperature for a source is 
of the order of 10 4 °K. 

(6) The effect of mismatch of the aerial impedances on 
the measurement of source diameter 

It has been shown in Appendix 1 (c) that if the aerial feeder and diode noise source 
present unequal impedances to the amplifier input, a displacement of the zero of the 
record of equivalent aerial temperature can occur. Such a displacement would 
affect the observed maximum to minimum ratio if it were comparable with the 
amplitude at the minima, and would have the effect of increasing or decreasing the 
apparent diameter of the source. For an impedance mismatch ratio of 1*1:1, the 
maximum zero displacement of equivalent aerial temperature is of the order of 
150° K. During the experiments described in § 7 the intensity of solar radiation was 
very great and even at the minima the equivalent aerial temperature was many 
times the maximum zero displacement. A mismatch of the impedance of the whole 
aerial system could not therefore produce an appreciable error in determining the 
source diameter. 

An additional source of error may arise from asymmetry of the aerial system; any 
difference between the polar diagrams or impedances of the two aerials would cause 
unequal powers to reach the junction point of the two feeders, and would thus 
increase the observed minimum power. Such an effect would be interpreted as an 
increase of the source diameter. 

The polar diagrams of the two aerials could easily be made sufficiently equal to 
introduce no appreciable error, but careful impedance measurement was necessary, 
since a mismatch ratio of 1*1:1 could give rise to a minimum power, with a point 
source, equal to 10 % of the maximum. With the maximum aerial spacing which 
could be used, the observed values of maximum to minimum ratio were of the 
order of 8:1, 

It should be noted that any errors in the determination of source diameter which 
are due to unequal impedances of the two aerials, will result in an overestimate of 
the diameter. The final figure of 10 min. of arc, obtained during these experiments, 
is therefore a maximum value. 


References 

Appleton, E. V. 1945 Nature, 156, 534. 

Appleton, E. V. & Hey, J. S. 1946a Nature, 158, 339. 
Appleton, E. V. & Hey, J. S. 19466 Phil . Mag. 37, 73. 
Burgees, R. E. 1941 Proc . Phye. Soc. 53, 293. 

Covington, A. E. 1947 Nature, 159, 405. 

Dicke, R. H. 1946 Rev. Sci. Inatrum. 17, 268. 

Dicke, R. H. & Beringer, R. 1946 Antrophys. J . 103, 375. 



<< 1 - 6 . 8 ^ 


120 


M. Ryle and D. D. Vonberg 

Hey, J. S. 1946 Nature, 157, 47* 

Lovell, A. C. B. & Banwell, C. J. 1946 .Nature, 158, 517, 

Martyn, D. F. 1946 Nature, 158, 308. 

McCready, L. f Pawsey, J. L. & Payne-Scott, R. 1947 Proc. Roy. Soc . A, 190, 357. 
Nyquiat, H. 1928 Phys. Rev . 33, 110. 

Pawsey, J. L., Payne-Scott, R. & McCready, L. 1946 Nature, 157, 158. 

Sander, K. F. 1947 Nature , 159, 506. 

Schottky, W. 1922 Ann. Phya., Lpz, f 68 , 157. 

Southwortli, G. C. 1945 J. Franklin Inst. 239, 285. 


Natural evaporation from open water, bare soil and grass 

By H. L. Penman 

Physics Department , Rothamsted Experimental Station , Harpenden , Herts. 

{Communicated by B. A. Keen , F.R.S. — Received* 9 October 1947) 

[Plate 3] 

Two theoretical approaches to evaporation from saturated surfaces are outlined, the first 
being on an aerodynamic basis in which evaporation is regarded as due to turbulent transport 
of vapour by a process of eddy diffusion, and the second being on an energy basis in which 
evaporation is regarded as one of the ways of degrading incoming radiation. Neither approach 
is new, but a combination is suggested that eliminates the parameter measured with most 
difficulty— surface temperature—and provides for the first time an opportunity to make 
theoretical estimates of evaporation rates from standard meteorological data, estimates 
that can be retrospective. 

Experimental work to test those theories shows that the aerodynamic approach is not 
adequate and an empirical expression, previously obtained in America, is a better description 
of evaporat : on from open water. The energy balance is found to be quite successful. Evapora¬ 
tion rates from wet bare soil and from turf with an adequate supply of water are obtained as 
fractions of that from open water, the fraction for turf showing a seasonal change attributed 
to the annual cycle of length of daylight. Finally, the experimental results are applied to 
data published elsewhere and it is shown that a satisfactory account can be given of open 
water evaporation at four widely spaced sites in America and Europe, the results for bare 
soil receive a reasonable chock in India, and application of the results for turf shows good 
agreement with estimates of evaporation from catclunent areas in the British Isles. 

List of symbols used 

x, y, z Co-ordinate axes downwind, aorosswind and vertical. * 

«, Mean horizontal wind velocity in x direction measured at height z; 

usually in miles/day. 

T„ T a , T d Temperature of surfaoe, air and dewpoint; usually °F. 
e„ e a , e d Saturation vapour pressure at above temperature; usually in mm. Hg. 
Relative humidity = e d je a , 
deJdT„. 

(e,-e 0 )/(e,-e <J ). 

Constant defining hydrolapse. 

Bowen’s ratio * y{T, — T u )/(e, - e d ). 



Natural eva/poration 


121 


y 

Eq. Eg , i?j. 

K 

R ( > 

r a \ 


ml 10 

n/N 

H 

K, fl, C 


5 B, 16T 
A 

R y D 
li 


Constant of wet and dry bulb hygrometer equation; in 0 F and mm.Hg, 
y = 0*27. 

Evaporation rate from open water, bare soil and turf; usually in mm. /day. 
Value of E 0 obtained by putting e s = e a in sink strength formula. 
Short-wave radiation from sun and sky; usually in evaporation equi¬ 
valent of mm./day. 

Angot value of R c for a completely transparent atmosphere. 

Radiation reflexion coefficient. (Also used for runoff without any 
possibility of confusion.) 

Fraction of sky covered by cloud. 

Ratio of actual/possible hours of sunshine. 

Net radiant energy available at surface. 

Parts of H used in convective transfer to air, storage in water, con¬ 
duction to surround. 

Depths to water-table (in.) and type of cover (bare, turf). 

Specific yield of soil. 

Rainfall, drainage. 

Beaufort wind force. 


1. Introduction 

Three kinds of surface are important in 4^e return of rain to the atmosphere. For 
extended areas of land, they are, in order of importance: vegetation, on which plant 
leaves act as transpiring surfaces; bare or fallow soil, from which water evaporates 
at, or just below, the soil-air interface; and open water, from which evaporation 
takes place directly. Although the last may be of predominant importance locally, 
e.g. in attempting to assess the water balance of lakes and reservoirs, the chief 
justification of the great attention given to it (see § 2, below) is found in the oppor¬ 
tunity it presents of providing a reproducible surface of know n properties. Because 
of this, it is convenient to approach the problems of the dependence of evaporation 
from bare and cropped soil on weather conditions through a study of evaporation 
from open water, seeking an absolute relation between weather elements and open 
water evaporation, and comparative relations between losses from the soil and losses 
from open water exposed to the same weather. 

Evaporation from bare soil involves complex soil factors as well as atmospheric 
conditions: transpiration studies add to these further important physical and 
biological features, for a plant’s root system can draw on moisture throughout a 
considerable depth of soil, its aerial parts permit vapour transfer throughout a 
considerable thickness of air, and its photo-sensitive stomatal mechanism restricts 
this transfer, in general, to the hours of daylight. A complete survey of evaporation 
from bare soil and of transpiration from crops should take account of all relevant 
factors, but the present account will be largely restricted to consideration of the 
early stages that would arise after thorough wetting of the soil by rain or irrigation, 
when soil type, crop type and root range are of little importance. 



122 H. L, Penman 

2. The estimation of evaporation from weather data 

Two requirements must be met to permit continued evaporation. There must 
be a supply of energy to provide the latent heat of vaporization, and there must be 
some mechanism for removing the vapour, i.e. there must be a sink for vapour. 
Analytical attacks on the problem start from one of these two points and it is con* 
venient to consider the latter first as it has been the more popular. 

(a) Sink strength 

(i) Empirical equations 

Until recent years the approach was empirical, a hundred years’ work since Dalton 

having produced little improvement in the form of equation he gave. In essentials 

it is _ 

E = {e„-e d )f(u), ( 1 ) 

where E is the evaporation in unit time, e„ is the vapour pressure at the evaporating 
surface, e d is the vapour pressure in the atmosphere above, and f(u) is a function of 
the horizontal wind velocity. For water, e a is known if the surfaoe temperature is 
known. Of the many empirical formulae cast into this form, one due to Rohwer 
( 1931 ) summarizes results of very intensive work at Fort Collins, Colorado, at 
5000 ft. above sea-level. Other things being equal, Rohwer found a small variation 
of evaporation rates with atmospheric pressure, and reduced to conditions at sea- 
level, his equation for the daily rate fronf an open water surface 3 ft. square is 

E — 0-40(e g —e d )(l +0-27M 0 )mm./day, ( 2 ) 

where vapour pressures are in mm. mercury, and wind speed at ground level is in 
m.p.h. Examining the effect of size of surface on evaporation rates, over a period 
of 485 days, he compared the observed values of evaporation from a large surfaoe 
88 ft. diameter with the estimates based on ( 2 ), and found the mean value of observed/ 
estimated to be 0-77. There is some bias here, however, for the average wind speed 
over the whole period was only 1-50 m.p.h., and examination of the individual daily 
records shows that on the rare occasions of a wind speed in exoess of 3 m.p.h. the 
correction factor is nearly unity. The ground wind velocity, w 0 , is an extrapolated 
value estimated from a number of readings at various heights, and if from Rohwer’s 
u, z ourve we interpolate at 2 m., the relation becomes 

E = 0-40(e, — € d ) (1 -t-(M7w s )mm./day, (3) 

and except at very low wind speeds might be expected to apply to large open water 
surfaces. 

(ii) Aerodynamic equations 

As an alternative to this empirical treatment an aerodynamical approach has 
been made in reoent years. A simple treatment (Penman 1940 ) showed that the right 
order of magnitude could be obtained by assuming that the main resistance to the 
evaporation ourrent is provided by a thin layer of air (c. 1 to 3 mm. thick) next to the 



Natural evaporation 123 

surface, in which air movement is essentially non-turbulent, and vapour movement 
across which is by a process of molecular diffusion. The more formal treatment, 
having wider* implications than the solution of evaporation problems, has con¬ 
sidered the turbulent mixing and transport of the vapour outside this sublaminar 
boundary layer, and it attempts to take into account the dependence of evaporation 
rates per unit area upon size and shape of the test area as well as upon weather 
elements. An account of this work is given by Brunt ( 1939 ) up to and including 
the work of 0. G. Sutton ( 1934 ). Extension of Sutton’s work by W. G. L. Sutton 
( 1943 ) and Pasquill ( 1943 ) has given an expression for the total evaporation from 
a rectangular strip of length x 0 downwind and width ?/ 0 : 

Vo) - C(e 9 - 4) 4 78 4*V (4) 

where 0 is a constant related to the absolute temperature, e' d is the vapour pressure 
of the air at a height great enough to be unaffected by the evaporation, and u 2 is 
the wind velocity at z = 2 m. Although e! d is unobservable, it has been possible to 
use the same general theory to express the shape of the hydrolapse, and to set 
i e 8~~ e d) — i) where e d is the measured value at screen height, and a (==0*52) 

is almost independent of u and x 0 . Differentiating (4), the rate of evaporation at 
x 0 is obtained, and, substituting numerical values appropriate to zero temperature 
gradient, it becomes E = (> ,, ^^ ram . /day . (6) 

In the open it is impossible to fix the position of the leading edge, but arbitrarily 
putting x 0 = 1*6 x 10 6 cm. (10 miles), the evaporation rate becomes 

E = 0*376(e*~<? d ) t # 76 mm./day, ( 6 ) 

where e g and e d are in mm. mercury, and u 2 is now in iniles/hr. If is measured in 
miles/day—a practical convenience—the rate is 

E = 0*033(e, - e. a ) t ^ 70 mm./day. ( 6 a) 

Notes. ( 1 ) The assumption of zero temperature gradient involves the identity of 
e 8 and e ai where e a is the saturation vapour pressure at the mean air temperature; 
e H - e d then becomes e a ( 1 — h) . 

( 2 ) A tenfold increase in x 0 will decrease E in the ratio ( 1 /H)) 0 * 12 , i.e. to 1/1*3, 
and the constant in ( 6 a) will become 0*025. 

(6) Energy balance 

Certain simplifying assumptions are needed; where they are known to be reason¬ 
able, reliable estimates of evaporation are possible. Using as the unit of energy the 
amount required to evaporate l/10g. of water at air temperature (59 cal.) it is pos¬ 
sible to build up the following expression for the heat budget, H , taking into account 
the incoming short-wave radiation from sun and sky, and the long-wave exchanges 
between earth and sky (Brunt 1939 ; equation 15, p. 136; equation 25, p. 144): 

H **R a (l- r-fi)-(rTi( 0*56 - 0-092 je d ) (1 - 0*09m), 


(?) 



124 H. L. Penman 

where R c is the measured short-wave radiation/cm, a /day, 
r is the reflexion coefficient for the surface, 

/1 is the fraction of R c used in photosynthesis, 
crT\ is the theoretical black-body radiation at T a °K, 
e d is in mm. mercury, 

and m /10 is the fraction of sky covered with cloud. 

Using the convenient symbols of Cummings & Richardson ( 1927 ), the heat budget 
is used in evaporation, E , heating of the air, K , heating of the test material, S, and 
heating of the surroundings of the test material, C, i.e. 

H » E + K + S + C. ( 8 ) 

Over a period of several days, and frequently over a single day, the change in the 
stored heat, 8, is negligible compared with other changes and the same may be 
true of the heat conducted through the walls of the test material container. Thus, 
( 8 ) can often be safely reduced to 

w H « +A. (9) 

The transport of vapour and the transport of heat by eddy diffusion are, in essentials, 
controlled by the same mechanism, and apart from the differences in the molecular 
constants, the one is expected to be governed by (e H — e d ) where the other is governed 
by (T g — T a ). To a very good approximation, therefore, it is possible to write down 
the ratio of K/E in the form 

K/E — - y(T a —T a )j{e a — e d ), (10) 

where /?, the ratio symbolized by Bowen ( 1926 ) as R, has the value — 1 in the stan¬ 
dard wet and dry bulb hygrometer equation, and y is the standard constant of this 
equation. In °F and mm. Hg, y = 0-27. 

Thus H = E(l +/?), jE? = tf/(l+/?). ( 11 ) 

Of the terms on the right-hand side of (7), the radiation term will rarely be 
directly measurable, but for periods of the order of a month or more it can be 
estimated from duration of sunshine. Angot has given tables of the total radiation 
to be expeoted if the atmosphere were perfectly transparent (Brunt ( 1939 ), p. 112 ), 
and there appears to be a general correlation between R c jR A and n/N in the form 
R c /R a * a + bnjN, where n/N is the ratio actual/possible hours of sunshine. For 
Virginia, U.S.A., Kimball ( 19 x 4 ) finds a — 022 , b = 0-54; for Canberra, Australia, 
Prescott ( 1940 ) finds a « 0-25, b — 0-54. At Rothamsted, monthly values over the 
period 1931-40 lead to a * 0*18,6 = 0*56, with a suggestion of a seasonal variation. 
Using these latter constants we have 

R c =* R A (0-l8 + 0'55n/N). 


( 12 ) 



Natural evaporation 125 

In terms of the maximum to be expected (/i A *, for n = N), equation ( 12 ) beoomes 

R a * I^(0*25 + 0*75?i/A), ( 12 o) 

agreeing with the form given by Angstrom for Stockholm (Brunt ( 1939 ), p. 127). 

The value of fi is very small (c. 0*005) and can be neglected. The value of r will 
vary with season and type of surface. For water its annual mean will be about 0*00 
in the British Isles; for bare soil, about 0 * 10 ; and for turf might be about 0*20 
(Geiger ( 1927 ) quoting Angstrom). Note that r and /i will be the only effective factors 
in (7) discriminating between the different types of surface. 

The terms expressing the net flow of radiation to and from a cloudless sky are 
due to Brunt and are based on the mean values of the constants obtained in six 
correlations of the energy flow with mean air temperature. Sverdrup ( 1945 ) gives 
a diagram indicating values of the same order but with slightly greater values of 
crT € f(e d ). The uncertainty here, however, is negligible compared with that arising 
from the cloudiness term. It is obvious that cloud control of long-wave radiation 
must depend upon cloud type, and as a provisional expedient to make some allow¬ 
ance for this it is proposed to set m /10 = 1 — njN. Equation (7) therefore reduces to 

H » E(\+fi) * (l-r)^(0*18 + 0*55n/A) 

-crT*(0*56 - 0*092 <Je d ) (0*10 + 0*90 n/N) (18) 

where R A ( 0 * 3 8 + OSSn/JV) is to be replaced by R c when this is known. The parameters 
represented on the right-hand side of (13) are easily determined; to obtain E it is 
necessary to obtain ft, which involves knowing the surface temperature (equation 
10 ). The sink strength approach also involves this knowledge, and although arrange¬ 
ments can be made to measure it experimentally, it is desirable to eliminate it for 
the prediction of evaporation or for a survey of evaporation as a climatic element. 


(c) Combination of sink strength and energy balance 
From (1), expected to take the form of ( 0 ) or ( 6 a), we have 

E~{e^-e d )f(u). ( 1 ) 

Let E a be the value of E obtained by putting e a instead of e 8 . Then 

i.e. EJE = 1- (e, - *„)/(«,- e d ) = 1 - <j> say. (1 4 ) 

From (10) and (11); 

E = H/(l +/?) = H/[ 1 +y(T s -T u )l(t R -e d )l 
If we set T,-T a = (e e - eJ/A, where A is the slope of the e: T curve at T = T a , then 
H/E - 1 + y(e i -e a )/A(e g -e d ) « 1 +y^/A. (15) 

From (14) and (15); E = (HA + E a y)ftb + y), (16) 

«» = (e«-0«<i)/(l-0)» ( 17 ) 



126 


H. L. Penman 


i.e. E can be estimated from air conditions only, and, if required, an estimate of 
surface temperature can be obtained that might be useful outside evaporation 
studies. 

In addition to the constants, readily obtainable from standard sources, the 
weather parameters needed are mean air temperature, mean dewpoint, mean wind 
velocity at a standard height and mean duration of sunshine. 

3. Rothamsted experiment 1944, 1945 
(a) Experimental site 

Experiments have been carried out in the meteorological enclosure at Rothamsted, 
situated at about 420ft. above o.i>. in open parkland in the Chiltern Hills. The 
enclosure includes a brick-lined pit, 8 ft. deep and 20 ft. diameter, around which 
twelve cylinders were set in the soil in 1924, five of them being filled with a sandy 
loam from Woburn (Bedfordshire), the soil texture being uniform throughout. The 
cylinders are Oft. deep and 2 ft. 6 in. diameter and are made of cast iron lined with 
a lin. layer of bitumen painted concrete; the bottoms have a slope down to an 
outlet pipe accessible from the pit. (See figure 1 and figure 7, plate 3.) 




Figube 1. (a) Section of cylinder, and (6) plan of the pit. 

The soil was left to settle and weather for 16 years so that some semblance of 
natural structure could be attained and by May 1940 the settlement amounted to 
6 in. This was made good by a further supply of Woburn soil, experimental work 
was done in 1941 and 1942 and in the spring of 1944 there was a further slight topping 
up in preparation for the work now to be described. 





Penman 


Pi or. Roy. Soc. A. volume 193 , plate 3 



{fr'twing p. 1 Sift) 


Figure 7, The experimental site June 1944. looking north-east. 






Natural evaporation 127 

Ten cylinders were joined up in pairs at the outlets, each soil cylinder being con¬ 
nected to an unfilled cylinder referred to below as the 4 minor ’, so forming a set of 
U tubes. Figure 1 shows the arrangement schematically, with three cylinders 
labelled (0, 5B and 16T). These are the main ones to be discussed and were the same 
in both years; changes were made in the others early in 1945. Waterproof covers 
were provided for the minors to prevent entry of rain and to reduce evaporation 
losses to negligible amounts. On A and C turves were laid in April 1944 and on 1) in 
March 1945; the other soil surfaces were kept bare. 

At the outset the minors were filled with water until the soil or turf surfaces were 
flooded and then water was run out until the water-table had reached a pre-deter- 
mined depth below r the soil surface. The depths and the nature of the surface are 
given in table 1: 


Table 1. Depths (in.) of water-table and nature of surface 


cylinder ... 

A 

n 

c 

1) 

E 

1944 

10 (T) 

lfl (B) 

10 (T) 

10 (B) 

5(B) 

1945 

16 (T) 

24 (B) 

24 (T) 

36 (T) 

5 (U) 


Cylinder e was filled to near the brim and the level kept at about 1 in. below. This 
was the first open water standard, referred to as cylinder 0. In the early summer 
of 1945 a tank of sheet galvanized iron, 2 ft. 5 in. diameter and 2 ft. deep was supplied 
by the Meteorological Office and was set up at the north end of the enclosure about 
50 yd. north of the pit. A hole 1 ft. 9 in. deep w as dug into which the tank fitted firmly, 
and the water-level was kept at or near ground level, so leaving a projecting rim of 
3 in. This tank, referred to as tank MO, had the same area as the cylinder but was 
shallower, had a thinner and more conducting wall material, was completely sur¬ 
rounded by turf-covered soil w hereas the other hat! the pit on one side, and had 
a higher effective rim. 

Ground level round the cylinders had been raised so that soil level was the same 
inside and out except on one side of cylinder 0 where the topping up was not com¬ 
plete; for all the cylinders there was a big discontinuity in surface on the pit side and 
although the pit should have been roofed in, it was not practicable at the time and 
a major objection to the experimental site had to be accepted as unavoidable. 
Figure 7 , plate 3 , shows the exposure to the north-east and the disposition of some 
of the other components of the enclosure; in the centre, beyond the pit, is the large 
rain gauge ( 1/1000 acre) used for rainfall values employed below, and to the right of 
it are visible two of the bare drain gauges on which earlier Rothamsted work was 
based. The general exposure here is good, although the presence of the large gauges 
might affect local eddies with east and north-east winds. The general exposure to 
north-west was equally good, that to south-west a little worse, and that to south-east 
poorest of all due to an extended belt of trees, the nearest being about 80 yd. away. 

The surround varied during the experiment. The local exterior topping up 
remained bare for a while, but a crop of weeds soon developed and gave a local cover 
nearly enough equivalent to the turf of the enclosure. By the summer of 1945 there 



128 


H, L. Penman 


was a fair amount of grass in this and it was possible to out it with a mower. Growth 
was very rapid during 1945 and part of the surround, too rough for a mower, got out 
of hand and for a while there was a stand of tall grass on the west side which may have 
had an adverse effect on transpiration from cylinder A (16T). In both years the field 
to the east was sown with mangolds giving a green cover from June until late autumn, 
at which time the soil surface was moist and remained moist for the remainder of 
the winters. The field to north and west carried an oat crop in 1944, about 1 ft. high 
in May, 3 ft. high throughout July and harvested in early August; the undersown 
clover then provided a green cover for the remainder of the season, was grazed during 
1945, but grew away from the animals and a hay crop was eventually taken from it. 
To the south there was a pasture, kept short by grazing. Thus, except for short 
periods, the experimental surfaces could be reasonably described as being in the 
midst of an extensive area of short vegetation and as long as this transpired at 
maximal rate, then for so long did the experimental conditions come close to 
satisfying the basic assumptions from which equation (6) was derived. 

(6) Measurements and calibrations 

In addition to the normal 09.00 observations of a third-order meteorological 
station, supplementary measurements were made. 

(i) Temperature. Mercury-in-steel thermographs were set up on cylinders 0, D, E 
and 0; that on D was transferred to tank MO in June 1945. The long bulbs were 
horizontal and were either pressed into the soil or supported in the water so that 
about half was below the surface and half above the surface; the water bulbs had 
sheaths of muslin to ensure that they were always wet all over. They were calibrated 
in place, and in all cases the corrected * surface ’ temperature is more truly the mean 
temperature of the top few mm. of soil or water. To find the daily mean a smooth 
curve was drawn through the thermogram and a mean of six readings at 4 hr. 
intervals found; this was corrected from the calibration curve and the corresponding 
value of the saturation vapour pressure taken as the daily mean value of e 8 . 

Only one value of the dewpoint was obtained per day and although this was found 
to be adequate for long period surveys it was not always adequate for individual days, 
particularly where there was a pronounced change during the day. The Dunstable 
Branch of the Meteorological Office kindly supplied estimates of the dewpoint at 
6 hr. intervals for each day and from these it was possible to see the way in which the 
dewpoint had changed in a given period of 24 hr. and to weight the Rothamsted 
values accordingly. Obtaining a reliable daily mean value of the dewpoint remains 
one of the main experimental problems to be solved. 

The mean air temperature is never of first order of importance and it has been 
sufficient to take the conventional mean of maximum and minimum for this para¬ 
meter. 

(ii) Wind . A three-cup anemometer was set up at 2 m. in the south-west corner 
of the enclosure in 1944, and was moved to the middle of the enclosure early in 1945 
so as to be about half-way between the pit and the new MO tank. The scale reading 



Natural evaporation 129 

was read once daily, and in view of this it was an obvious convenience to use miles/day 
as the unit of wind velocity (equation (6a); figures 2 and 3). The instrument was 
calibrated in December 1944 in terms of a similar instrument with smaller cups, 
with which a calibration curve was supplied. The curve was non-linear at low speeds 
and calibration of the experimental instrument is consequently uncertain in this 
region: table 2 shows the result: 

Table 2. Calibration of anemometer (miles/day) 

observed 0 30 60 00 120 150 210 270 

corrected 38 50 78 106 133 161 215 272 




w, (miles/day) 


Figure 3, Daily evaporation per unit partial pressure difference for open water surface 
(tank MO). The curve is: E 0 /(e g -e d ) = 0 * 0650 $**. 


Vd 193 * A * 


9 



130 


H. L. Penman 


A Dines wind recorder on the laboratory roof, about J mile away, was used to esti¬ 
mate direction and variation of wind speed during the day, where necessary. 

(iii) Radiation. A continuous record of radiation intensity on a horizontal surface 
was obtained each day, the total area under the trace measured by planimeter and 
the figure so obtained converted to the equivalent number of mm. of water that the 
total energy would evaporate at air temperature. 

The duration of sunshine was obtained from a Oampbell-Stokes recorder. 

(iv) Emporation. Daily measurements were made of the depth of the water- 
levels below arbitrary zeroes. 

Cylinders A, B, C and D . A rigid cradle was built into the top of the minor into 
which could be fitted a framework carrying a screw ending in a sharp pointed dip¬ 
stick. The screw carried a scale that moved past a fixed index mark and readings 
could be made to better than £ mm. except on very windy days. 

Cylinders E and 0. Measurements here were somewhat cruder. A solid straight 
edge was placed across the top of the minor in a marked position and, by means of 
a guide, a pointed rule was slid down until it just touched the water surface. With 
care, readings could be reproduced to within about J mm., and in the major part 
of the experiment this was adequate accuracy. 

Tank MO. A hook gauge reading to x&$in. was UBe d. 

Changes in level are due to evaporation or rainfall, both being excluded from the 
minors. A fall in level takes place in both arms of the U system when evaporation 
occurs, so that for a change in minor level of Sz , the total evaporation is greater, 
and may be set equal to <fe(l -f A), where A is the specific yield of the soil with water- 
table at zcm. below the surface. If, over a period, the measured rainfall is R , then 
the total evaporation is given by 

E » <fe(l + A) + I?. (18) 

As A is a function of 2 , measurements were made to give the values in table 3. 

Table 3. Specific yield of soil 

depth of water-table (in.) 5 10 16 24 36 

specific yield, A (cm./om.) 0*02 0*04 0-10 0-13 0*13 

4. Results for individual days 
(a) Open water: sink strength 

Equation ( 6 a) was tested by plotting EJ(e 9 - e d ) against u v Figure 2 shows the 
result for cylinder 0 with the 1944 and 1945 data distinguished, figure 3 shows the 
result for tank MO from mid-June 1945, and in figure 4 are given values of 
2 J? 0 /£(e; ■- e d ) for wind speed ranges sufficiently wide to include at least four obser¬ 
vations in the summations. The data represented in these figures have been selected 
as follows: (i) rain days have been excluded as there is some uncertainty about the 
uniformity of rainfall distribution over the site; on such a day the fall in level is 
made up of evaporation minus rainfall, (ii) For cylinder O, only those days have 



Natural evaporation 181 

been used in which ^ 0 >2-6mm.jind (e,-e d )>2-5 mm.; for tank MO, to increase 
the number of available results, the limits were lowered to 2-0 mm. The experimental 
errors tend to be absolute and the uncertainty in the ratio increases greatly as E 0 
and Ae become very small. 



Figure 4. Mean daily evaporation per unit partial pressure difference for open water (groups 
of days having approximately the same average wind speed). The continuous line is; 
#o/( € * — «*) “ 0*35( 1 + 9*8 x 1(M u 8 ). O cylinder O; — • — tank MO;-Rohwer, 

The scatter in figures 2 and 3, although not very much worse than that obtained 
by other workers doing indoor experiments, is considerable. A number of obvious 
contributory factors have been examined and shown to be of slight effect: dryness 
of the surround, wind distribution during the day, height of rim and season of the 
year. The main sources are undoubtedly the meteorological observations themselves; 
in increasing order of importance they are: dewpoint and surface temperature deter¬ 
minations and wind velocity measurements. Concentrating on the last, it is doubtful 
whether a measurement at a single height and the assumption of zero velocity at- 
ground level are sufficient to define the wind velocity profile even over a smooth 
surface; they cannot be expected to take account of the local turbulence introduced 
by many obstructions and surface irregularities. These will vary with wind direction, 
and an analysis of the cylinder 0 results showed that all high values of EjAe at high 
u % were for days with north-east winds, i.e. days in which the local exposure would 
be most conducive to extra turbulence. It seems, therefore, that in spite of their 
greater scatter, the tank MO results are probably a better guide to a general law 
than the cylinder O results, and this is supported by the comparison (figure 4) with 
Rohwer’s results. (Because of the scatter it is probable that the mean curves do not 
differ significantly.) The mean curves show that there is a linear relationship between 
and u ti but for comparison with the theoretical form two curves could be 
fitted: (i) through the overall mean value of E/Ae and of a curve EjAe 
(of. equation 8 a) and, (ii) through the two general means obtained from the groups 


9-2 



132 


H. L. Penman 

of points lying to the left and right of the overall mean, a curve 27/Ae =» av%. The 
results, in decreasing order of efficiency, are given in table 4. 

Table 4. Values of f(u) ik E q — (e* - e d ) f(u) 

cylinder O tank MO Rohwer* 

best 0-30(1 + 14*2 x 10 3 u a ) 0*35(1 + 9*8 x I0~*u t ) 0*40(1 + 7*1 x 10~*w*) 

Rood 0*033 wS m 0*065wj M 

fair 0-021 Mg 79 0*019u$ M — 

The curves drawn in figures 2 and 3 are the ‘good’ curves, wind velocities being 

in miles per day. 

From the above, it is concluded that: (i) the best form of (1) for practical use is 
E q ~ 0*35(1 +9*8 x 10" s w 2 )(^-e^)mm./day, w 8 in m.p.d.; (19) 

(ii) for analysis, demanding a curve passing through the origin, the power of the 
wind velocity is much nearer ^ than f; (iii) if the form of equation (6a) is to be 
maintained, the constant is to be reduced from 0*033 to about 0*020, a result that 
may be due to inaccurate assumptions about the distance away of the hypothetical 
‘leading edge*, may be due to the departure from zero temperature gradient, or 
may be due to an inaccurate value of a in specifying the shape of the hydro-lapse. 

( b ) Open water : energy balance 

The initial objective in this approach was an application to extended periods in 
v which the assumptions made in reducing (8) to (9) would be reasonable; the main 
discussion will be of this aspect and appears below, but in view of its success it 
seemed worth while extending the application to individual days. An estimate of 
J7 mo , based on energy, was obtained for most days between mid-June and the end 
of September 1945 and results are shown in figure 5 and table 5. The latter includes 
a representative sample of the data of the figure, the choice being made at roughly 
6 -day intervals with an attempt to give reasonable ranges of wind velocities and 
sunshine conditions, and affords a comparison of the estimates based on energy 
balance and sink strength (equation 19) with each other and with the observed 
values of daily evaporation. The values of (e # -e rf ) range from 1*10 to 7*40 mm., 
those of <rT A from 13*6 to 14*8, those of (0*56-0*092^) from 0*30 to 0*26, the 
product of the last two functions tending to be constant, and those of (0* 10 + 0* 90 n/N) 
from 0*10 to 0*70, From the last, it will be seen that the most important term in 
the back radiation is the cloudiness factor—the least certain of any. Although they 
could be deduced from other columns, values of /? are given. The figure is extremely 
encouraging. With the table it shows that the energy balance estimate is too big in 
mid-summer but improves later in the year. The change in the reflexion coefficient 
(here taken as constant throughout) would act in the opposite sense, and apart from 

* Wing-Cdr Frost, in a private communication, has stated that observations at Poona, 
India, which were reduced to the form Ef&e * aw* w by the experimenters can be equally well 

fltted b y E=. 0-40( 1 + 7-3 x 10-*U,) (e,- e<) ram./day. 

This is almost indistinguishable from the Robwer equation. 



Natural evaporation 133 

* 

any deficiencies in (13) itself it is probable that the main cause of the over-estimate 
is the warming of the bottom of the tank to a higher temperature than the outside 
soil so producing a positive value of the factor C in (8). The neglect of C in (9), 
therefore, leads to an over-estimate of H and hence of IL 




Figure 5. Comparison of the observed daily evaporation from open water (tank MO) and 
estimates based on the energy balance. — 0-- observed; — •— energy balance estimate. 
Year 1945, 

Table 5 . Energy balance estimate for tank MO for individual days, 

COMPARED WITH THE OBSERVED VALUE AND THE SINK STRENGTH ESTIMATE 
(EQUATION 19) evaporation (mm./day) 


date 

u % 

0-95/e o 

H 


estimates 

observed 

1945 

(m.p.d.) 

(mm./day) 

1 

I 

fi 

*/(! + £> 

a <:f{u) 

E* o 

June 11 

149 

4-36 

3*64 

0*37 

2*6 

2*4 

2*3 

16 

80 

5-62 

4*52 

0*48 

3*0 

2*7 

2*5 

22 

92 

8*28 

5*58 

0*03 

5*4 

4*2 

4*7 

27 

118 

7*19 

4*82 

0*35 

3*9 

4*1 

2*5 

July 1 

197 

7*16 

5*01 

0*16 

4*3 

4*7 

3*2 

8 

96 

8*49 

5*98 

0*28 

4*7 

4*8 

7*8? 

12 

50 

8*19 

5*80 

0*15 

5*0 

3*1 

3*6 

17 

111 

6*45 

4*78 

0*15 

4*2 

3*2 

3*6 

23 

128 

9*09 

5*76 

0*23 

4*7 

5*7 

4*8 

30-31 

129 

3*72 

2*78 

— 0*07 

3*0 

2*6 

2-7 

Aug, 3 

67 

7*58 

4*11 

0*12 

3*6 

4*3 

3*6 

8 

80 

3*65 

2*67 

0*25 

2*1 

1*4 

1*2 

17 

122 

5*65 

3*76 

— 0*07 

4*0 

2*8 

2-8 

23 

182 

5-00 

3*43 

0*06 

3*2 

3*2 

3*8 

26 

63 

7*23 

3*90 

0*17 

3*4 

3*4 

3*6 

Sept. 2 

196 

5*40 

3*70 

*0*18 

4*5 

3*3 

3*8 

8 

128 

1*90 

1*53 

0*15 

1*3 

1*4 

L8 

9 

50 

1*43 

1*07 

0*04 

1*0 

0*8 

0*8 

14 

133 

1*72 

1*36 

* 0*02 

1*4 

0*9 

M 

22 

169 

4*52 

2*34 

*0*31 

3*4 

2*3 

3*8 

%1 

146 

3*17 

1*63 

— 0*17 

2*0 

2*3 

17 


,'0 



134 H. L. Penman 

i 

From the table the sink strength estimates based on the fitted curve appear to 
be a little better than the energy balance estimates. The correlation coefficients 
between observed and estimated values are about 0*80 in each case, and when it is 
remembered that the fitting has reduced the sink strength estimate to about 80 % 
of its theoretical value it is apparent that on the basis of the original predictions the 
energy balance estimate is the better. The two estimates agree on 8 July when the 
observed value was extremely high and was queried at the time of observation. In 
open country there are several causes of spurious high readings; rabbits and birds 
appreciate an open pool on a hot day and although the enclosure was refitted with 
wire netting for this experiment it is impracticable to take measures to ensure 100 % 
freedom from intrusion. Leaks are usually unidirectional and although a big leak 
is easily noticed, a small one, particularly if variable, could easily be overlooked. 
Replication is the only safe control here, and the close agreement of the sink strength 
results for cylinder O and tank MO is regarded as confirmation of the general 
water-tightness of both containers. 

(c) Emporation on individual days (other surfaces) 

No detailed analysis has been attempted, for two reasons. With water-tables at 
some distance below the surface there is always some drying out of the soil above 
the water-table that does not affect the water-table, so that on rain-free days the 
movement of the water-table does not represent all the evaporation taking place. 
On a rain day, no rise in water-table will occur until this accumulated deficit of 
moisture has been made up and on such days the estimate of evaporation based on 
water-table movement and rainfall will be excessive. 

The second reason is that changes in soil temperature from day to day produce 
changes in the surface tension of the water held in the soil above the water-table and 
slight water movements take place accordingly. During dry periods the movements 
of the water-table in cylinder 24 B were due almost entirely to temperature changes, 
and evaporation was negligible by comparison. 

Over an extended period the effect of temperature changes under the saturated 
surfaces is negligible; if the period is chosen so that its beginning and end are marked 
by a rise in water-table due to rain then one can be sure that the soil moisture content 
above the water-table is the same at beginning and end, and (18) can be applied with 
confidence. Such periods have been termed Natural periods’ (Penman & Schofield 
1941 ) and are the basis of the ensuing discussion. 

5. Evaporation in natural periods 
(a) General 

The main discussion will be confined to the open water surfaces, the bare soil 
with water-table at 5 in. and the turf with water-table at 18in., but it will be useful 
to give a brief account of the performance of other cylinders. The bare soil cylinders 
with water-tables at 10, 10 and 24 in. failed to satisfy the condition of continuous 



Natural evaporation 136 

saturation at the surface; the first showed signs of drying during extended periods 
of ramfree weather and the other two were almost continuously dry except for a 
day or two after rain. Under turf, the water-table at 10 in. was probably too high 
for the proper development of the plant, and both growth and transpiration were 
slightly less than for cylinder 16T; the 1945 results showed transpiration to be 
about the same for cylinders 16T and 24T, although the crop yield was greater on 
the latter; the turf over the deepest water-table (cylinder 36T) failed to establish 
itself with the same luxuriance as the others and little of any value has yet emerged 
from the results. 


1944 1945 

May Junel Jul. Au&SeplOct. No* Dec] Jan. FebMadApr. MayJunelM Aug.SeplOct. 



Fioubb 0 . Mean daily evaporation in natural periods for surfaces with non-limiting water 
supply. (Open-water: O and MO. Bare soil with water-table at 5 in.: 5B. Turfed soil with 
water : table at 16 in.: 16T.) ■—O— Observed valves. Estimates: —x— sink strength, 
energy balance and ^ combined. 


The results of 18 months for cylinders 0, 5B, 16T and tank MO are summarized 
in figure 8, and with a few supplementary notes the figure should be almost self- 
explanatory. Apart from the large winter gap, when snow and ice prevented readings 
from being obtained, there are one or two minor gaps of a few days when records 
were unobtainable. The lines drawn have no significance other than helping the eye 
to follow seasonal changes. 



136 


H. L. Penman 


The sink strength estimates are based on the mean wind velocity for the period, 
and the mean values of T s and T d from which e 9 and e d were derived. For cylinders, 
O, 5B and 16T the linear expression derived for cylinder 0 was used (table 4, 
column 1); for tank MO the corresponding equation (19) was used. 

The radiation estimates are based on (13) and (9), using reflexion coefficients of 
0-05, 0-10 and 0*20 for open water, bare soil and turf respectively; for cylinder O 
and tank MO, therefore, the value of H was the same, but the values of fi were not 
necessarily the same, depending upon the values of the surface temperatures. 

As the surface temperature of tank MO was not measured until mid-June 1945, 
the combined estimate (equation 16) was made for the preceding natural periods. 

( b ) Comments and conclusions 

(i) For cylinder 0 the sink strength estimate is a reasonably good fit throughout, 
suggesting that in selecting results for figure 2 there has been no undue bias. 

(ii) For cylinders 5B and 16T the sink strength estimate is good in 1944 but too 
big in 1945. It is difficult to interpret the turf results as the thermometer bulb was 
usually covered by growing grass of varying length; perhaps one ought to be sur¬ 
prised at such an exposure combined with a formula based on open water leading 
to results so near observation. In the case of the bare soil, a reasonable explanation 
is available. During the summers, and particularly in 1945, a tough wiry weed 
established itself on the surface and by its mere physical presence probably slowed 
up evaporation by decreasing wind speed over the surface. As a result of this a little 
more heat would be available for warming the surface, i.e. T s) and hence e 8 and 
(e, - e d ) } would increase, so leading to an increased estimate as a result of a decreased 
observed evaporation. 

(iii) For cylinders O, 5B and 16T the radiation estimates are almost invariably 
too low. In some of the winter periods the value of /? reached very uncertain values 
near — .1 and the derived values of Hj( 1 4-y?) were absurd. These values are not 
plotted. 

(iv) For tank MO both estimates are reasonably good and table 5 and figure 5 
above may be regarded as giving the fine structure of some of the results shown in 
figure 6. 

(v) Comparison of the results for cylinder 0 and tank MO so far quoted seem, 
superficially, to be in conflict; the same sink strength formula fits both whereas 
the radiation estimates fit the tank results only. The evaporation from the tank was 
usually about 25 % less than that from the cylinder (table 6 below), but the surface 
temperature was correspondingly lower, leading to the near agreement of table 5. 
It is suggested that the increased evaporation from oylinder 0 was due to the greater 
surface temperature, i.e. to an extra supply of energy being available to it that was 
not available for the tank, and that the exclusion of this additional energy supply 
from the energy balance for cylinder 0 led to the noted underestimates of evaporation 
on this basis. Whereas tank MO had an all-soil surround, 0 and the other cylinders 
had a hollow on one side. The air in this would warm up during the day to tem- 



Natural evaporation 137 

peratures much in excess of the soil or water in the cylinders at the same depth,' 
leading to an inflow of heat, i.e. to a negative value of 0 in ( 8 ), at least for midsummer 
months. 

It is reasonable to assume that the ©fleet would be the same order for all cylinders, 
i.e. that relative values of evaporation from the three kinds of saturated surfaces 
would not be materially affected and that one of the major aims of the experiment 
has not been upset by the deficiency of the experimental site, 

6. CONCLUSIONS. FROM RESULTS OF PRECEDING PARAGRAPHS 

It is convenient to interpolate a short set of conclusions based on the study of 
daily and periodic evaporation. Without repeating the reservations already made 
concerning experimental accuracy and theoretical adequacy, we have: 

(а) A sink strength formula has been obtained that agrees closely with the results 
of intensive work by Bohwer, and is substantially the same for two open water 
surfaces having different environments. 

( б ) Aa energy balance has led to close agreement with observed values for one 
of these surfaces in which the conditions most nearly satisfy the basic assumptions 
made in striking the balance. 

( c) The discrepancy for the other surface is of a kind that can reasonably be 
applicable to all the cylinders similarly exposed, so that relative values of evapora¬ 
tion rates may be deduced. 

7. Results (continued): relative evaporation rates 

Table 0 shows the seasonal variation in evaporation from cylinder 0 , expressed 
in in./day, and the relative rates for bare and turfed soil, the natural periods being 
grouped roughly in calendar months. For 1945 the mean daily rates for tank MO 
are included. The bare soil cylinder was set up for a now experiment in December 
1945 involving destruction of the surface. 

Natural surfaces that are bare, such as arable land before a crop is established, 
will rarely remain moist in summer when weeds can grow, and will, under ordinary 
cultivation, rarely grow weeds in winter when the soil will remain moist. It is, 
therefore, permissible to select results from the preceding table and to state that\ 
theevaporationjrate from a freshly wetted bare soil will be about 90% of that from an j 
open water surface exposed to the same weather. This is in agreement with the results 
of indoor experiments (Penman 1941 ) and other outdoor work (e.g. White 1932 ). 

It is not so easy to reach a firm decision about the grass surface. Assuming, for 
simplicity, that the leaf temperature is always the same as that of the open water 
surface, the ratio E1 r jE 0 will depend upon the length of daylight (A hr,), and the 
difference between the minimum surface temperature and the dewpoint. If this 
difference is large compared with the diurnal temperature change, the value of 
E T jE^ will be of the order of iV/24, i.e. will range from about 0-70 in summer, to 
about 0*30 in winter at Rothamsted, As the excess of minimum over dewpoint 



138 


H. L. Penman 


temperature decreases, both these extreme ratios increase, and when the difference 
iB zero they will be of the order 0-95 and 0-58 respectively, with a value of about 
0-83 at the equinoxes (assuming a sinusoidal variation of diurnal temperature 
change). When the dewpoint temperature is greater than the surface minimum, 
conditions are more complicated, as condensation will take place on both kinds of 
surface, and until the dew is evaporated both will behave as open water surfaces 
whatever the light conditions. Under these conditions the relative evaporations 
over a whole day will approach equality, although the absolute amounts will tend 
to btj small. * 

Table 6. Seasonal variation in relative evaporation 


period 




period 

E 0 



E uo 

1044 

(in./day) 

Eb/Mo 

Et/Bq 

1946 

(in./day) 

e b /e 0 

e t /e q 

(in./day) 

Feb. 

— 

— 

— 

8 Feb.-4 Mar. 

0*04 

0*78 

0*31 


Mar. 

— 

— 

— 

5 Mar.-7 April 

0*00 

M0 

0*80 

0*00 

April 

— 

— 

— 

8-30 April 

0*10? 

1*05? 

0*92? 

0*10 

9 May-1 June 

0-18 

0*91 

0*83 

4-25 May ) 

28 May-9 June/ 

0*11 

0*89 

0*69 

0*11 

2-28 June 

0-19 

0*08 

0*88 

10-22 June \ 

23 Jun©~2 July/ 

0*16 

0*83 

0*72 

0*12 

29 June-4 Aug. 

0*16 

0*82 

0*89 

3-12 July \ 

16 July-8 Aug./ 

0*17 

0*84 

0*71 

0*14 

5 Aug.-2 Sept. 

0-19 

0*81 

0*78 

12 Aug-2 Sept. 

0*11 

0*87 

0*90 

0*08 

5-30 Sept. 

0*10 

0*91 

0*79 

6-15 Sept. l 
16-24 Sept./ 

0*09 

0*61 

0*61 

0*07 

1-10 Oet. 

20 Oct.-6 Nov. 

0*05 

0*65 

0*48 

25 Sept.~22 Oct. 

0*07 

0*70 

0*64 

0*00 

7-13 Nov. 

20-30 Nov. 

0*03 

0*54 

0*0 

1-28 Nov. 

0*03 

0*7? 

0*0 

0*02 

1-22 Dec. 

0*01 

0*8 

-0*4 

29 Nov.-22 Dec. 

0*05 

— 

-0*0 

0*05 


The simple initial assumption is not likely to be realized in practice, and both 
mean surface temperature and its daily amplitude will be important. In winter, 
when the important temperature differences are likely to be small and unequal there 
may be more condensation on one surface than another and so negative ratios might 
be obtained (December 1944 and 1945). 

It is clear that a more detailed study of this part of the problem is needed, and 
at the moment only a limited generalization can be made with the reservation that 
it may apply to the Rothamsted site and the years 1944-45 only. Using the totals, 
for midsummer periods (May-August inclusive) the ratio is 0*81; for equinoctial 
periods (Maroh-April, September-Ootober) it is 0*72; for midwinter the results are 
too few and too erratic to attempt expression of a ratio. We oan, however, get an 
indication of the order of magnitude from another souroe. At Fleam Dyke, Gam*’ 
bridge, two drain gauges, one bare and the other turfed, are maintained side by side. 
Over winter months, when the summer drying has been made good in both gauges, 
E B is greater than ,E T . For the months Deoember-Maroh inclusive, in the years 
1989-40 to 1942-43, the totals were: 

ZE B m 10*5in., XE t = 7*6in. i.e, E T jE B « 0*725 



Natural evaporation 139 

(data kindly supplied by Mr Porteous, Engineer-in-charge, Cambridge University 
and Town Waterworks Co.)* Putting E B /E T * 0-90, then E r jE 0 =* 0-65, and as 
the March transpiration would be greater than that in November, the midwinter 
ratio would be smaller. To a satisfactory degree of accuracy it may be rounded 
off to 0*60. 


8. Conclusions from results of seasonal variation 

IN RELATIVE EVAPORATION 

(а) The evaporation rate from continuously wet bare soil is 0*9 times that from 
an open water surface exposed to the same weather conditions in all seasons. 

(б) The corresponding relative evaporation rate from turf with a plentiful water 
supply varies with season of the year. Provisional values of E r IE 0 for southern 


England are: 

Midwinter (No vember-February) 0*6 

Spring and autumn (March-April, September-October) 0*7 

Midsummer (May-August) 0*8 

Whole year 0*75 


(c) The discrepancy between cylinder O and tank MO is greatest in midsummer 
when the effect of extra heat flow through the walls of cylinder O is likely to be moBt 
important. 

9. Tests on other data 

(a) Data required and methods of using them 

Before the conclusions of § 8 can be applied to soil surfaces it is necessary to 
estimate the evaporation that would take place from an open water surface exposed 
to the same weather. To avoid reference back, the requirements are repeated: 

(i) for the sink strength estimate it is necessary to know the mean surface tem¬ 
perature, the mean dewpoint temperature and the mean wind velocity. Then 

E 0 = 0*36(1 + 9*8 x 10~ 3 a 2 )(e 8 ~e d ) mm./day; (19) 

(ii) for the energy balance estimate the requirements are: mean daily short-wave 
radiation (or mean daily duration of sunshine), mean daily cloudiness (or mean daily 
duration of sunshine), mean air temperature, mean dewpoint temperature, and 
mean surface temperature. Then 

E 0 = Hj(l +/?) mm./day, (II) 

where H « R^l -r)~oTJ(O*50-0*092 Je d ) (0*10 + 0*90n/N) 

and S 0 + ^(0*18 + 0*66w/N); 

(iii) for the combined estimate there must be known, mean air temperature, mean 
dewpoint temperature, mean wind velocity, and mean daily duration of sunshine. 

Then JS? 0 « (HA + 0-27J57 o )/(A + 0-27) mm./day, (16) 




140 


H. L. Penman 


where H is given by (13) above, and 

E a = 0*35(1 + 9'8x 10~ a u a )(e a —e d ) (of. (19)). 

The values of N and R A will vary with latitude and season, but are readily obtain¬ 
able from standard sources. The value of r has a similar double variance but it will 
probably be sufficiently accurate to use a constant value of 0-05. 

For general use, where E 0 has not been directly measured, the third method is 
most useful, and as the dependence on wind speed is not very critical, a Beaufort 
wind force can be substituted for u s so giving an opportunity of making an evapora¬ 
tion estimate from the data of a weather map. To convert the sink strength formula 
it is sufficient to use the ‘good’ expression for oylinder 0 (table 4) combined with 
two standard conversion factors: 


E 0 = 0-033«| 68 (e g — e d ), w 2 in miles/day; 
m 2 = 0-78m 10 ; 

w 10 = 1-87J5* x 24, « 10 in miles/day; 

i.e. E 0 = (M)33[0'78 x 24 x 1-87B*]®* 8 (e t -e d ); 

= 0-37 J B 1 ' n2 (e s — e a ) mm./day. 


The coarseness of the Beaufort estimates of wind force suggests that this may safely 
be reduced to 


E 0 = 0-37B(e 8 -e 4 ) mm./day, 


( 20 ) 


giving E a - 0-37 B(e a -e d ) mm./day 

for use in (16). 


Although evaporation data exist for many sites over long periods of time there 
are not many sets that have sufficient contemporary meteorological data alongside 
to enable a comparison of observed and predicted evaporation to be made. The few 
oases discussed below have been chosen to give a fairly wide variety of sites and 
types of surface. 

(b) Open water 

(i) Fitzgerald (1886). Pans were floated in the middle of an 86 acre reservoir at 
Chestnut Hill, Boston, Mass. Table 7 gives results for a pan 10ft. diameter and 10ft. 
deep, filled to within 3 in., and sunk to within 6 in. of the top. The anemometer was 
at 30*6 ft. above the surface. Values of E 0 , u, T„ T a and h were measured on 8 days 
between June and October 1886. Using (19) the following results are obtained, 
E 19 representing the estimates. . 


Table 7. Daily evapobation at Boston, U.S.A. 


day 

1 

2 

3 

4 

5 

6 

7 

8 

u a (ra.p.d.) 

223 

135 

116 

150 

127 

75 

61 

129 

(e,-e d ) (mm.) 

10*4 

7*6 

7*8 

7*7 

10*0 

8*0 

6*3 

2*6 

JEit (mm./day) 

lie 

6*2 

5*8 

6*7 

7*8 

4*9 

3*3 

2*0 

E 0 (mm./day) 

10 " 7 

6*8 

6*1 

6*6 

7*1 

7*1 

4 1 

2*5 



Natural evaporation - 141 ' 

(ii) Visentini (1936). A pan was floated in the reservoir at Molato, Italy, with an 
anemometer on the dam. Values of 1 \, T a , u and h are given. In the absence of 
information it is assumed that the anemometer height was 2 m. and the calibration 
the same as for the Rothamsted anemometer. 


Table 8. Mean daily evaporation at Molato, 

Italy 


1934 

Aug. 

Sopt. 

Oct. 

Nov. 

Dec. 


u (m.p.d.) 

67 

48 

44 

82 

77 


* («•-«*) (mm*) 

8*2 

6*85 

6*8 

3*^ 

1*95 


2£ l# (mm./day) 

4-8 

3*5 

2*9 

2*1 

1*2 


J5 0 (mm./day) 

5*1 

3*9 

4*1 

2*3 

1*3 


1935 ... Mar. 

Apr. 

May 

June 

July 

Aug. 

Sept. 

u (m.p.d.) 106 

96 

99 

87 

91 

70 

72 

( e B~ e a) (mm.) 3*0 

5*25 

3*9 

8*2 

8*5 

6*3 

5*4 

E l9 (mm./day) 2*1 

3*6 

2*7 

5*4 

5*7 

3*9 

3*3 

E 0 (mm./day) 1*8 

2*7 

3*0 

6*2 

7*9 

5*7 

4*3 


(iii) Davydov (1936). Measurements were made on Sevan Lake, Soviet Armenia 
(no details). Values of (e s — e d ) between lake surface and 10 cm. above, and of wind 
velocity at 9 m. are given. Results are means for 1927 - 30 : 


Table 9. Mean daily evaporation at Sevan Lake, Soviet Armenia 



Apr. 

May 

June 

July 

Aug. 

Sept. 

Oct. 

Nov. 

Dec. 

u t (m.p.d.) 

136 

106 

123 

157 

119 

144 

140 

153 

183 

(e.-e-d) (mm.) 

0*1 

0*0 

1*5 

2*3 

3*9 

4*7 

4-9 

3*8 

3*3 

E lt (mm./day) 

0*1 

0*0 

1*2 

2*0 

3*0 

4*0 

4*1 

3*3 

3*2 

E 0 (mm./day) 

0*4 

0*2 

1*2 

2*1 

3*4 

4*5 

4*1 

3*6 

2*9 


(iv) Ray (1931). Monthly m<?ans for a standard U.S. Weather Bureau evaporation 
pan are given for 1917-30 for San Juan, Puerto Rico, together with u 0 , T a , mean 
saturation deficit and mean hours of sunshine. For speed of computation a 12 hr. 
day has been assumed for the whole year. As results are given in in./month and 
(e a — e d ) in inches of meroury, the same form is kept in table 10: 


Table 10. Mean monthly evaporation at San Juan, Puerto Rico 



Jan. 

Feb. 

Mar. 

Apr. 

May 

June 

E a (in./month) 

5*35 

4*75 

7*3 

7*2 

7*6 

7*4 

H (in./month) 

3*8 

5*15 

6*1 

7*0 

7*05 

7*7 

E X i (in./month) 

4*25 

5*05 

6*4 

7*05 

7*2 

7*65 

E 0 (in./month) 

5*55 

5*55 

7*6 

7*85 

7*75 

7*45 


July 

Aug. 

Sept. 

Oct. 

Nov. 

Dec. 

E a (in./month) 

8*55 

8*16 

6*05 

5*0 

4*8 

5*75 

H (in./xnonth) 

7*1 

7*55 

6*4 

5*3 

4*3 

3*8 

E u (in./month) 

7*55 

7*7 

7*3 

5*2 

4*45 

4*3 

E 0 (in./month) 

8*15 

8*05 

6*6 

5*85 

5*1 

5*46 



142 


H* L. Penman 


(c) Bare soil 

There are few records available and information about related weather is even 
more rare. By the courtesy of the Indian authorities, the rainfall, drainage and 
other weather records for Pusa have been made available from 1907 to 1934. Pusa 
lies in the monsoon region of Asia, at latitude 26° N. in the Ganges basin, and in 
most years it is a fairly reasonable assumption that once the monsoon has broken, 
bare soil will remain moist for most of the monsoon period. 

In 1906 four drain gauges were constructed, each 1/1000 acre in area, without 
disturbing the soil, two being 3 ft. deep and two 6 ft. deep. One at each depth was 
kept bare and the other pair cropped, and daily records of rainfall and drainage for 
months in which at least one gauge ran are available from 1907-34. The crop carried 
by gauges II (6 ft.) and IV (3 ft.) during the monsoon period was either maize 
(1907-15) or sunn hemp (1916-34), sowing taking place in June each year. Both 
are tall plants (8 ft. and 6 ft. high), and, standing well above the turf surround, they 
would be better ventilated, intercept more radiation and expose a larger transpiring 
surface to the air than a patch of the same size in the middle of a large field. Trans¬ 
piration would be abnormally great, and the evidence of the cropped gauges is to 
be rejected on the ground that the same surface was not typical of its environment. 

The records were separated into natural periods in which the difference between 
rainfall and drainage (or drainage and run-off) can be equated to the evaporation 
of the period. As drainage continued for several days after rain it was not always 
possible to decide which was the last rainfall causing drainage, and there are in¬ 
evitable uncertainties in the estimation of (R - D) per day as a result. As elsewhere, 
all estimates of evaporation are very much dependent upon the drain gauge receiving 
the same amount of rain as the rain gauge. 

The values of R—D fo r gauges I and II (6 ft. and 3 ft. bare) usually agreed closely 
and there was general consistency in the performance of gauges II and IV. A con¬ 
densed summary for 2 years shows the order of evaporation per day from all four; 
further analysis will be restricted to the records for gauge I. 


Table 11. Comparative evaporation at Pusa 



JR/day 

(in.) 


(K-£>)/day (in.) 


period 

I (6'B) 

II (6'C) 

III (3'B) 

IV (3'C) 

1911 

8-18 July 

0*34 

0*17 

0*24 

0*15 

0*23 

19 July-25 Aug. 

0*47 

0*16 

0*37 

0*15 

0*35 

26 Aug.-26 Sept. 

0*27 

0*12 

0*24 

0*12 

>0*27 

27 Sept.-13 Oot, 

0*22 

0*12 

0*11 

0*10 

no D 

1922 

2-26 July 

0*72 

0*11 

0*22 

0*08 

0*13 

27 July-9 Aug. 

0*53 

0*09 

0*21 

0*05 

? 

10-20 Aug. 

0*24 

0*12 

0*20 

0*10 

T 

21 Aug.-23 Sept. 

0*21 

0*09 

, 0*22 

0*06 

0*19 

29 Sept.-5 Oct. 

0*19 

0*13 

0*19 

0*14 

>0*19 



Natural evaporation 143 

Weather observations at Pusa (1911-38) were made at 08.00 hr. and included dry 
and wet bulb temperatures, air maximum and minimum temperatures, anemometer 
readings at 08.00 and 08.03 hr., cloud amount, and 'instrumental test observations’, 
the last being the actual readings of the three dry thermometers, presumably ex¬ 
pected to be equal when read. These readings never agreed and differences were 
erratic, ranging up to 3° F, the usual order being dry > max. > min. 

Assuming that the anemometer ran continuously between 08.03 hr. one day and 
08.00 hr. the next, the run-of-the-wind per day was obtained for all days except 
when the reading passed through an unknown zero; the height is assumed to be 2 m. 
and the calibration the same as the Rothamsted instrument (table 2 above). From 
the mean air temperature and the 08.00 hr. value of dewpoint the value of the m ean 
saturation deficit was obtained. From these, values of E n were obtained. 

The determination of H had to be based on a single cloud estimate per day; 
comment is unnecessary. There seemed little point in evaluating it for all periods 
in all years and only two, 1011 and 1922, are considered in detail (table 12). Mean 
wind speeds (as measured) ranged from 33 to 133 miles/day, mean and 08.00 hr. 
air temperatures from 81 to 86° F, 08.00 hr. humidity from 87 to 92 % and estimated 
nj'N from 0*00 to 0*64. 

* (rf) Cropped soil 

Using the annual summary of the Monthly Weather Report, data for 70 stations 
in the British Isles have been abstracted for the years 1930-39 and long period means 
obtained for mean air temperature, mean vapour pressure, mean Beaufort wind 
force and the mean ratios of actual/possible hours of sunshine. From these, values 
of E a and H were obtained for each site and values of E 0 derived, using equation (16). 
Converted to inches per year, an evaporation map of the British Isles was obtained 

Table 12. Estimated E b and measured (jR-D)/day for 
bare soil: Pusa 1911 and 1922 


period 

1911 

i?/day 

®./day 

H/day 

A’,« 

0*9 E u 

(R-D )/day 

(in,) 

(in.) 

(in.) 

(in./day) 

(in.) 

(in.) 

8-18 July 

0*34 

0-08 

0*11 

010 

0*09 

0*17 

19 July-29 Aug. 

0*46 

Oil 

0*10 

015 

0-13 

0*10 

30 Aug.--8 Sept. 

0*17 

017 

0*13 

0-14 

0*13 

0*11 

9-20 Sept. 

0*32 

0-08 

0*16 

0-14 

012 

0*14 

27-30 Sept. 

0*18 

0-07 

0*11 

0-10 

0*09 

014 

1-13 Oct. 

0*22 

0-09 

0*14 

013 

0-11 

0*10 

1922 

2-20 J uly 

0*80 

0*09 

0*12 

0*11 

0*10 

0*10 

21-28 July 

0*45 

0*12 

0*12 

012 

0-11 

0*15 

27 July-9 Aug. 

0*54 

0*12 

0*14 

0*13 

0*12 

0*09 

10-18 Aug. 

0*12 

0*04 

, 0*08 

0*07 

0*00 

0*08 

10-20 Aug. 

0*38 

0*12 

0*17 

0*16 

0*14 

0*18 

21-26 Aug. 

0*10 

0*07 

0*12 

0*11 

0*10 

0*10 

27-30 Aug. 

0*25 

0*00 

0*10 

0*10 

0*09 

0*16 

31 Aug.-5 Sept, 

0*25 

0*08 

0*12 

0*11 

0*10 

0*13 

0-10 Sept. 

0*15 

0*09 

0*22 

0*18 

0*17 

0*13 

11-28 Sept, 

0*34 

0*09 

0*14 

0*13 

0*11 

0*14 

29 Sept,-5 Oct. 

0*19 

0*02 

0*19 

0*14 

0*13 

0*14 



144 


H. L. Penman 


showing the probable value of annual evaporation from open water in exposed sites. 
From the conclusions in § 8 above one would expect the corresponding value of the 
annual evaporation from cropped land to be f of E 0 if the crops transpired at maxi¬ 
mum rates all the year; in practice the rates will be less than this because of the 
ripening process in annual vegetation and/or the lack of summer rainfall, particu¬ 
larly in south-east England, but in the table below this conversion factor is applied 
uniformly. The table shows the observed difference between rainfall and runoff for 
certain catohment areas (Lloyd 1940 , 1941 . 1942 ) (these are rather monotonous), 
the observed difference multiplied by 5 , which should be the expected corresponding 
open water evaporation, and the estimated value of E 0 based on annual mean 
values of weather elements for stations somewhere near, if not in, the catchment 
area. 

Table 13. Evaporation from catchment areas 




mean 


estimated for 



rainfall-runoff 


nearby sites 

catchment 

period 

(in./year) 


(in./year) 

Lea 

1928-36 

19-2 

26 

Greenwich, 25 
Rothamftted, 20 

Thames 

1928-36 

18-7 

25 

Kew, 26 

Oxford, 24 

Severn 

1928-36 

18-8 

25 

Ross-on-Wye, 24 
Cheltenham, 21 
Shrewsbury, 21 

Vrnwy 

1932-38 

191 

26 

Shrewsbury, 21 
Sealand, 24 
Llandudno, 23 

Rivington 

1932-38 

17-4 

23 

Stonyhurst, 21 
Hutton, 18 

Spey 

1936 

10-3 

14 

Dalwhinnie, 17 


Detailed examination of Rothamsted data has shown that the sum of the twelve 
monthly estimates exceeds the annual estimate by about 10 %, due to the extra 
weight which should be given to summer evaporation. A similar increase is to be 
expected for other sites and should be borne in mind in reading table 13. 

10. General conclusion 

A detailed discussion of the data presented in § 0 ( 6 ), (c) and (d) would be in¬ 
tolerably long, and muoh of it would be concerned with the adequacy of the observa¬ 
tions rather than the adequacy of the equations on which the estimates are based. 
The general impression is satisfactory for all three types of surface, and the wide 
range of climatic regions employed indicates that something of universal signi¬ 
ficance has been obtained in the results of §§ 3 to 8 , although there must inevitably 
be something of the time and place at whioh the experimental work was done 
included in the equations. 

Two aspects of evaporation have been under review. There is that of the physicist 
and mathematician seeking facts to fit a formula; sufficient has emerged to show 



Natural evaporation 145 

possible sources of weakness in the theoretical treatment of both sink strength and 
energy balance and to show the relative importance of the quantities that must be 
measured to obtain adequate accuracy in experimental trials. There is also that 
of the 4 practical ’ man—water engineer or meteorologist—seeking a formula to fit 
the facts. Formulae have been given and where all the necessary measurements 
can be made, or values forecast, then reliable evaporation rates can be estimated 
or predicted. There are still empirical aspects of both sink strength and energy 
balance estimates and until these are removed there must always be some doubt 
about the possibility of successful translation in space or time of the formulae. 

The work described in the preceding pages is an extension of research carried 
out at Rothamsted Experimental Station before 1941. It was resumed in 1944 at 
the request of the Meteorological Office, and it is a pleasure to be able to record my 
appreciation of the assistance given by several branches of the Office in the form of 
equipment, information and advice. I am particularly indebted for many helpful 
discussions to Mr 0. S. Burst of the Meteorological Office, and to my colleague. 
Dr R. K. Schofield. To the Director of the Meteorological Office and the Meteoro¬ 
logical Research Committee I am grateful for the helpful reception given to the report 
on which this paper is based and for permission to publish it. 

Finally, this work could not have been done at short notice if the basic equipment 
of the enclosure had not been available: to Dr B. A. Keen, F.R.S., until recently 
Head of the Physics Department lit Rothamsted, especially grateful acknowledge¬ 
ment is due for the foresight that provided the installation with full knowledge that 
it would be many years before the soil would be fit for experimental work. 


References 

Bowen, I. S. 1926 Phys. Rev. 27, 779 . 

Brunt, D. 1939 Physical and dynamical meteorology . Camb. Univ. Press. 

Cummings, N. W. & Richardson, B. 1927 Phys. Rev. 30, 627. 

Davydov, V. K. 1936 Assoc. Intern. D'Hydrologie Sci. Bull. no. 22, 142. 

Fitzgerald, D. 1886 Trans. Arner. Soc. Civ. Engrs, 15, 681. 

Geiger, R. 1927 Das Klima der bodennakm Lu/tschickt. Brunswick: F, Viewog. 

Kimball, H. H. 1914 Mon. Weath . Rev., Wash . 42, 474. 

Lloyd, D. 1940 Quart, J.R . Met. Soc, 66 , 181. 

Lloyd, D. 1941 Quart . JJt. Met. Soc , 67, 33. 

Lloyd, D. 1942 Qtiart. J.R. Met. Soc, 68 , 35. 

Pasquill, F. 1943 Proc. Roy. Soc. A, 182, 75. 

Penman, H. L. 1940 Quart. J.R. Met. Soc. 66 , 401. 

Penman, H. L. 1941 J. Agric. Sci . 31, 464. 

Penman, H. L. <fe Schofield, R. K. 1941 J. Agric. Sci . 31, 74. 

Prescott, J. A. 1940 Trans. Roy. Soc, S. Aust. 64, 114. - 
Kay, C. L. 1931 Mon. Weath. Rev., Wash. 59, 192. 

Rohwer, C. 1931 U.S. Dept. Agric, Tech. Bull. 271. 

Sutton, O. G. 1934 Proc. Roy. Soc, A, 146, 701. 

Sutton, W. G. L. 1943 Proc. Roy. Soc. A, 182, 48. 

Sverdrup, H. V. 1945 Oceanography for meteorologists. London: George Allen and Unwin. 
Vmentini, M. 1936 See Davydov. Bull. no. 22 , p. 119. 

White, W. N. 1932 U.S, Dept. Interior, Water Supply Paper 669. 




Relaxation methods applied to engineering problems 
XIII. The flexure and extension of perforated elastic plates 

By R. V. Southwell, F.R.S. 

(Received 20 October 1947) 

The theory given in Part VTIA { 1941 ) for a flat plate having a single external boundary ia 
here extended to allow for internal boundaries (holes) at which, as well as at the external 
boundary, either tractions or displacements may be specified. Such holes entail appreciable 
complication, more especially when boundary tractions are specified and do not form an 
equilibrating system at each hole severally: ‘orthodox’ studies (e.g, by Biokley, Green, How¬ 
land, Jeffery) have been restricted to simple shapes (e.g. one or more circular holes in a plate 
having infinite extension). But any shape can be treated by Relaxation Methods, without 
excessive labour and with all the accuracy that has practical value. A worked example 
(stresses in a loaded tie-bar) shows that holes may entail a cyclic form for Airy’s stress- 
function y. 


Introduction and Summary 


1. Part VIIA of this series (Fox & Southwell 1941 ) dealt both with flexure and 
extension of flat elastic plates. Starting from a r 6 sum 6 of accepted theory, it showed 
both kinds of straining to present the same computational problem, of which it 
distinguished two main cases: 

Case \A 9 calls for solution of a biharmonic equation typified by 

DVHv ~ Z(x,y) (1) 

(Z being specified), together with boundary conditions (imposed on w) which specify 
displacements in the flexural, tractions in the extensional problem. 

Case ‘ J3’ calls for solution of two simultaneous equations, typified by 


~+A(V*u+pXlp) = o' 
d ~+A(VH+pYlfi) = 0 


du 0 tA 
dx dy)’ 


( 2 ) 


in which all quantities except u and v are specified, together with boundary con¬ 
ditions (imposed on u and v) whioh specify displacements in the extensional, trac¬ 
tions in the flexural problem. 

For these two oases Part VIIA proposed two different methods, both involving 
the relaxational technique. Either could deal with ‘mixed’ boundary conditions 
(specifying displacements at some points, tractions at others), and the question, 
whioh of the two is preferable, was left to the test of experience. 

2 . Another matter not discussed in Part VIIA is treated in the present paper: 
namely, the case presented when a plate is perforated so as to have internal boun¬ 
daries (holes) at which—as well as at its external boundary—either tractions or 


A. (»7 M*y * 948 ) 


[ 147 ] 


10 



148 


R. V. Southwell 


displacements are specified. Here too the flexural and the extensional problem are 
analogous so that (when ‘mixed 9 boundary conditions are excluded) two cases 
‘A* and ‘B* (§1) arise. Case ‘B* is not exemplified, being of rare occurrence in 
practice and because it presents no computational problem that is not confronted 
in Case f A\ Case ‘A’ has practical importance, and in this paper is discussed in 
fuller detail: an extensional example (stresses in a loaded tie-bar) shows that holes 
may entail a cyclic form for Airy’s stress-function y. 

The whole investigation would have had only academic interest if Relaxation; 
Methods did not provide an alternative to orthodox biharmonio analysis, for in 
that analysis holes entail such complication that only the simplest shapes are 
tractable. By contrast, for Relaxation Methods all shapes are tractable with equal 
accuracy and with roughly equal labour (depending on the number of the holes). 
In bringing perforated plates within their range, this paper removes their one 
outstanding disadvantage as compared with the photo-elastic technique. 


I, Holes in relation to Case ‘A’ of the flexural problem 

3. In Case ‘A’ of the flexural problem the aim is to determine the transverse 
deflexions (w) of the middle surface, and from these the stresses. The governing 
equation is 

DVho = Z(x 9 y) (1) bis 

(D denoting the flexural rigidity), and the [boundary conditions (usually) specify 
values of w , dwjdx and dwjdy . They may do this either ‘absolutely 9 (so that at the 
outset aU boundary values of w , dwjdx, dwjdy are known) or ‘relatively’ in the sense 
that at each internal boundary something in the nature of a clamp may operate to 
impose local variations of deflexion and of slope, the clamp itself being free to move 
under the action of known external actions. When the specification is ‘absolute’, 
internal boundaries present no special problem. When it is ‘relative \ a displacement 
of its ‘ clamp* defined by 

w cl » A + Bx + Cy (3) 

may require corresponding additions to w, dwjdx and dwjdy at an internal boundary, 
and we have the problem of determining the constants A> J5, C\ 

4. Equation ( 1 ) being linear, solutions can be superposed. Accordingly we first 
examine the action of the clamp held fixed by imposing the relative displacements 
and (when we have solved the problem thus presented) deducing the consequent 
actions on the clamp from the boundary tractions. These tractions are given by 

w dH n r3_. JM ,a/a*w ia«>n f (4) 

3 « 1 _ 3 j> ^deyladv pfo/J’ 



149 


Relaxation methods applied to engineering problems 

which are ( 6 ) of Part VIIA. (?, H and N denote, respectively, the line-intensities 
of flexural moment, torsional moment and transverse (shearing) force, the senses 
being as shown in figure 1 . The fact,that N and H cannot be specified separately 
was first explained by Kirchho|f (of., e.g., Love 1927 , p. 27), 


2 



The action of the clamp upon the plate will comprise a transverse force Z in the 
direction of z, and moments L and M about axes parallel to x and y> given by 


Z 

M 


— <j> Nds , L » <j> [H cos (#, v) — 0 sin (#, v) + yiVj da , 
«* <j> {<? cos (x> v) + U sin (; x , p) — xN } da , 


(5) 


in which the integrations are to be taken round the whole of the internal boundary. 
With (?, Hy N given their expressions (4), and when account is taken of the fact that 
w and its differentials are single-valued, the expressions (5) can be identified with* 


Z 

M 


-D^^SIhoday L = -D ~ - sin (x> Vhvds, 
-f D <j> — cos (Xy v)J V a M>ds. 


(6) 


The orthodox approach 

5. For equilibrium of the clamp, external forces must operate to supply these 
actions which it transmits to the plate; so to the ‘relative* displacements must be 
added displacements, due to movement of the clamp, such that Z, L, M as given by 
( 6 ) are the resultants of the known external actions (§ 3). In an orthodox treatment 

1 d 

* It is to be remembered that (x, v). 

p <*9 


io*a 



160 R* V, Southwell 

w would be found by a synthesis of (for one internal boundary) four partial 
solutions: that is, by writing 

w - w 0 + Aw a + Btu B + Cw c (7) 

(where A, B, C are the constants so denoted in § 3) and using ( 6 ) to deduce corre¬ 
sponding expressions for Z, L, M of the type 

Z~Z 0 +AZ a + BZ b + CZ c . ( 8 ) 

These last give three equations by means of which A, B, C can be evaluated, and 
then all terms in (7) are known. In that expression w 0 is the solution obtained when 
A =« B « C = 0 in (3), so that the clamp is fixed; w A is obtained by making Z = 0 
in ( 1 ) and A = 1 , B = C = 0 in (3); and so on. 

When there is more than one internal boundary, each clamp independently can 
undergo a rigid-body displacement, so N clamps entail 3 N constants of the type 
A y By C f all of which enter into 3JV expressions of the type of ( 8 ). On this account 
the labour entailed is great when the plate has many holes. 

The relaxational approach 

6 . By contrast, Relaxation Methods dispense with this synthesis and obtain the 
wanted solution by a single computation. This is because the actions on a clamp can 
be ‘ liquidated ’ by‘ block displacements J used in the usual way. Having the standard 
‘relaxation pattern*, it is easy to deduce the changes made in Z, L, M by unit dis¬ 
placement or rotations of the clamp as a whole: these are the ‘block displacements* 
whereby Z, L and M are ‘liquidated* at the cost of new ‘residuals* at nodal points 
within the plate. Analogous treatment was employed in Part III (Christopherson & 
Southwell 1938 ) to solve the torsion problem of St Venant for a multiply-connected 
section. 


II. Holes in relation to Cask ‘A’ of the extensional problem 

7, In Qase ‘A 1 of the extensional problem, boundary tractions are specified, and 
the treatment recommended in Part VIIA entails a use of Airy’s stress-function x> 
In all cases which have practical importance the body-forces can be derived from 
a potential Q. Then, the governing equation isf 


V*{V*x + (1 ~<r) pH) *= 0 , ( 9 ) 

p denoting the (uniform) density of the plate; the stress components have the 


expressions 


'3+A x.—% 


v A y 

dzdy* v 


t Conditions of plane stress are here assumed. The treatment of plane strain is similar, but 
the factor (l-cr), in (9), must be replaced by (I — 2<r)/(l — <r) (cf. Port VIIA, §5). 



Relaxation methods applied to engineering problems 


151 


and the boundary conditions can be stated as imposing specified values on X^, Y v , 

X v —f£l cos (x,v) = |(|), Y v -pClshx(x,v) = -|(g) (11) 

(the senses being as shown in figure 2). The mathematical problem is to determine 
^ as defined by (9) and (11). 



Circumstances entailing cyclic constants 

0v 0V 

8. An immediate consequence of perforations is that & ^^ need not be single¬ 
valued as (on the tacit assumption of a single boundary) they were shown to be in 
Part VIIA, §7. According to (11), on any closed boundary 


d x_ f r 
«y~r 


{X„ - pQ cos (a;, v)} ds, (i) 

v s J 

so dxfiy is cyclic unless 

j){X„-pQco8(x, v)}ds - 0, (12) 

—i.e. (by Green’s transformation) unless 

= j>X,ds-jjp^dxdy = X„ds + jjpX dxdy = 0 (ii) 

when the surface integral is taken over the whole area of the plate. Equation (12) 
is satisfied when there is no internal boundary, for then the quantity on the right 
of (ii) must vanish for equilibrium. If on the other hand the plate is perforated, 
equilibrium requires (12) to be satisfied pnly when the contour integral relates to 
its whole perimeter: it need not hold in respeot of any one internal boundary. 
Similarly, aooording to (11), on any closed boundary 

_*X m 
dx 

so is cyclic unless 

<j) {I^-pG sin (£, v)}ds » 0. (13) 


J{l^-pU sin {x, v)} ds, 


(iii) 



152 


R. V. Southwell 


Equation (13) must be satisfied, for equilibrium, in respect of the boundary of a singly- 
connected region; but it need not hold in respect of any one internal boundary. 

da ~ da dy da dx ’ 


Finally, 


so 


d% dx 

- y a* + *s + 


d* x ' 

’ dady JrX dadx j 


ds, 


-M 

j[xY v ~yX, - {x sin (x, v) - y cos (x, y)}p£i] ds 


(14) 


according to (11). When there is only an external boundary, Green’s transformation 
can be used to show that 


(j> [xF v — yX v ~ {x sin (x, r) — y cos (x, y)} pil] ds 

tm j>(xY v ~yX v )d8 + JJ(xl r -yX)dxdy = 0 (15) 


for equilibrium; so then this expression for y is single-valued. But (16) need not 
hold in respect of any one internal boundary; so there y as well as 3y/3x and 3y/3y 
may be cyclic. 


9. To summarize, 3y/'3x is cyclic unless (13), 3y/3iy unless (12), and y unless (15) 
is satisfied for each internal boundary severally. On any boundary for which (12) 
is satisfied, the total action has no resultant in the direction of x; on any for which 
(13) is satisfied, it has no resultant in the direction of y ; and on any for which (15) is 
satisfied, it has no resultant moment about Oz . Tims the relaxation process (since 
it requires the boundary values of 3y/3x, dx/dy and y to be specified uniquely) will 
not be feasible unless the tractions at each internal boundary, severally, are self- 
equilibrating, If this condition is not satisfied in the problem as presented, the 
problem must be modified by removal of the cyclic part of y. 


y 



Procedure for elimination of cyclic constants 
10. Consider in this connexion, and for the notation which is indicated in figure 3, 
the functions r r 6 

x r *-^00080 , (16) 


all of which satisfy (9) with Q = 0—i.e. when no body-force is operative. The corre¬ 
sponding stresses are single-valued and have no singularity except at the origin of 



Relaxation methods applied to engineering problems 153 

co-ordinates, therefore make no contribution to (12), (13) or (15) in respect of any 
oontour which does not enclose that point; but for each function one or more of 
X, Sx/Sx, dx/dy is cyclic, as shown in table 1. In that table X, Y, N denote the total 
forces and couples entailed by the functions—i.e. the contributions which they make 
to the left-hand sides of (12), (13) and (15), respectively, for any dosed circular 
boundary surrounding the origin of co-ordinates. 

Table 1 



(i) 

(«) 

(ui) 


Xx 

Xr 

Xn 

10* IPX 
rdr 

cos 0 

sin 0 

A 

nr 

nr 

V 

3r* 

0 

0 

0 

0r\r &d) 

0 

0 

i 

2 nr* 

/* *2ir ^ 

X — — 1 r(rr cos 0— r6 sin 0) dO 

J 0 

Y — ~ f r(rr sin Q + rO cos 0) dO 

J 0 

1 

0 

0 

0 

1 

0 

N = -J r* .rBsdO 

0 

0 

1 

dv ,, dv sin 0 dy 

Bx dr r BO 

sin* 0 

/0 —J sin 20\ * 

sin 0 

2 n 

\ “2 to ) 

2nr 


/0 + i sin 20\* 

008* 0 

cos 0 

{ 2n ) 

2 n 

2 nr 


rOnmO* 

rO cos 0* 

( e Y 

X 

2 n 

2n 

w 


* Cyclic quantities. 


The corresponding actions on other internal boundaries will not be affected because 
those boundaries do not enclose the origin: consequently it is easy to attach such 
multipliers to the expressions (16) as will make those ‘ singularity solutions ’ account 
for the unbalanced resultants of the tractions on each internal boundary severally. 
(They will also entail tractions on other boundaries, but these will be self-equili¬ 
brating.) Thus a problem as presented can be modified so that at every boundary 
dx/dx, fixity and x are single-valued; and apart from constants of integration all 
three quantities will be known. 

Significance of the constants of integration in the boundary values 
of dxfix, dxldy and x 

11. The constants do not matter when (as in Part VII A) the plate considered has 
no perforations; for then they merely entail an addition to x which, being linear in 
x and y , has no effect upon the stresses. But they matter in relation to holes, exactly 





154 It V, Southwell 

as in the flexural problem (§ 3 ) those constants mattered which defined the rigid- 
body displacement of the clamp (though now for a physically different reason). 
Thus a problem still remains. 

Conditions of plane stress are contemplated in the argument which follows, but 
with an altered value for <r it will hold in eiroumstanoes of plane strain.* The reason 
why the constants of integration matter now is that, while any values will define 
a solution in which % (and hence all stresses and strains) are single-valued, this 
solution may entail self-straining, which is always a possibility in multiply-connected 
bodies (cf., e.g., Southwell 1941 , Chap. in). Consequently the relations which fix 
the values of the constants are to be sought in the conditions for compatibility of 
strain. 


12 . Expressed in terms of displacement the strain-components (necessarily 

single-valued) are ~ ~ . 

0 ou cu ov ov 

to 


6xx ~dx’ exv ~dy + dx’ 


e "~dy’ 


and the rotation m is given by 


dv d u 
W dx dy * 


(ii) 


On physical grounds u , v and w must be single-valued, and according to (i) and (ii) 

dv , du 


OX 




Therefore the relations 

a 


dh> 


dy {w + ^ ~ dydx 


dy 

d 


= ¥. 


*V 


■ TD. 


dx™’ dx'*** 


(Ktr-w) 


d*u d 

= «-e* 


Bxdy dy 


(iii) 


(iv) 


must be satisfied throughout any region in which u and v are single-valued; and 
hence the relation 


ii 1 52 .. 

2dxdy^ xv ~ vxdy ~ 2 dydx ^ dy' 1 


1 3* 3 a 


(▼) 


must be satisfied throughout any region in which w is single-valued. 

Expressed in terms of % the relation (v) is equivalent to (9) of §7, so it will be 
satisfied by any solution of that equation. In relation to a multiply-connected 
region, however, though necessary it is not sufficient, since it merely requires w, 
if cyclic , to have the same cyclic oonstant at every point. To make the constant zero 
the condition . 

f = 0 

must hold for each internal boundary severally : that is, by (iv), the condition 


j (° 08 < x > ^ {l x e ™~l£y e **) + Bia (x > *>(i 


1 3 

dy exx ~2dx exv 


,)}<& - 0 , 


* Cf. footnote to J 7. 



165 


Relaxation methods applied to engineering problems 
which, expressed in terms of stresses and thence of x, is 


( 17 ) 

When (17) holds in respect of every internal boundary, the fact that (9), § 7, is satis* 
fled implies that it will also hold at the external boundary. 

13. Similar considerations apply in respect of u and v: having satisfied (17) we 
have a non-cyclic solution for w which makes the relations (iv) compatible through* 
out the perforated plate; and when (iv) are satisfied then u and i>, if cyclic, must 
have the same cyclic constant throughout. The conditions that u and v- shall be 
non-cyclic are, respectively, 

ds — 0 (vi) 


% 

O 

* 


du t ^ , fdv 

- 7 -da as 0 and <b-r 
ds T ds 


for each internal boundary severally. They may be transformed on the understanding 
that (i)—(iv) are satisfied and w is single-valued. 


Thus 


f du - f Ldu 


according to (iii); therefore, since / = dyjds and w is single-valued, a condition 
equivalent to the first of (vi) is 


« - <f (+ + 

according to (iv). Expressed in terms of stresses and thence of x > this equation 
becomes 

or (sinoe x and dx/fa are single-valued) 


Clearly, equation (vii) is equivalent to 


0 , 


(18) 



156 


R. V. Southwell 


which accordingly is equivalent to the first of (vi); and similar treatment shows the 
second of (vi) to be equivalent to 



{V*x + (1 -<r)pQ)ds 


0 . 


(19) 


If satisfied in respect of every internal boundary, (18) and (19) will also be satisfied 
in respect of the external boundary. For the sum for all boundaries of the contour 
integral in (18) has, by Green’s theorem, the value 



% X + (1 — o’) pil} dxdy = 0 , 


since (9) is satisfied; and a similar argument holds in respect of (19). 


Orthodox and relaxational treatment 

l4. Orthodox treatment, though it does not require that cyclic constants be 
eliminated, normally presents insuperable difficulty. For Relaxation Methods the 
elimination is necessary: then, however, comparison of (17)—(19) with (fi), §4, 
shows that the modified problem can be solved by taking advantage of the analogy 
which relates x * n extensional with w in the flexural problem. Thinking now 
of x&s representing flexural displacement , we may envisage 'clamps’ which specify 
X , dxfix and 3 x/dy ‘relatively’ in the sense of § 3, but which can also undergo rigid- 
body displacements and thereby impose constants of integration. Now, according 
to (17)—(10), the clamps must ‘float’ because the forces acting on each severally 
are in equilibrium. In other respects the relaxational procedure is unaltered. 


III. Holes in relation to Case ‘B’ of the extensional problem 


15. In Case ‘B’ of the extensional problem the aim is to determine component 
displacements u and v which are governed by ( 2 ), § 1 , and of which the boundary 
values are specified. As in § 3 the specification may be either ‘absolute’ (so that at 
the outset u and v have known values at every boundary point) or ‘relative* in the 
sense that a ‘clamp’ which imposes known displacements at an internal boundary 
is itself free to move as a rigid body (now, in the plane of the plate) under the action 
of external forces. When the specification is ‘ absolute \ internal boundaries present 
no special problem. When it is ‘relative’, three constants are required to define 
the (small) rigid-body displacement of any clamp, since the most general form of 
this displacement is given by 

u cX » A - Cy , v cl *> B + Cx\ ( 20 ) 

so here, as in §3, the solution must determine three constants (A f B, C ) from the 
known external forces on each clamp. 

The relevant formulae are 


X-jY = j>Y,da, N = §(xY„-yX r )da, (21) 



157 


Relaxation methods applied to engineering problems 


in which X, Y, N are the (specified) external forces on any particular clamp, and the 
contour integrals extend to its whole perimeter (i.e. to the internal boundary in 
question); with 


in which 


X v * IXq-\- fflXy, Y v = iXy -f- fflYyi 


l s cos ( x , v) 


dy 

ds’ 


m & sin ( x , v) 


dx 

ds' 


( 22 ) 


and 


*+4M»' *>-(14:). l23 > 

in which (as in § 1) /i denotes the modulus of rigidity, A stands for j, and 

A is the constant so denoted in equations (2).* 

From (21)-(23) it is easy to deduce that 


Y/ft ‘j\£ +d £ +mAiA } ds> 

f Ks + D +y (&“^) +(,,Mf " ?y)A/i1 )*- 


‘(24) 


16. Orthodox treatment (normally very difficult) employs the formulae (24) to 
deduce the values of A, B , C in (20), and hence of u ch v cl , in the manner of §6. 
A relaxational treatment will use them to deduce the changes made in X, Y, N by 
unit displacements or rotation of the clamp as a whole, and will proceed to ‘ liquidate ’ 
X, Y, N by * block displacements’, in the manner of §6. Thus every new problem 
confronted on account of ‘ holes ’ has its parallel in Case ‘ A ’ of the flexural problem. 


IV. Holes in relation to Case ‘B’ of the flexural problem 

17. In Case ‘B’ of the flexural problem the aim is to determine the transverse 
deflexions (w) of the middle-surface, and from these the stresses, tractions being 
specified by the boundary conditions: that is to say, at every boundary point we 
have known values of 0 and of {N — dH/ds ), quantities which are related with w 
by (4), § 4. In Part VIIA (§§ 9-11) this case was shown to be reducible to a problem 
mathematically identical with Case ‘B’ of the extensional problem (§ 16), and like 
that problem to be tractable (for Relaxation Methods) by a * two-diagram technique \ 

* Of. footnote to §7, In terms of Poisson’s ratio <r 

A =s 1 — 2<r, under conditions of plane strain, 

= -—-, under conditions of plane stress. 

1-H7 



158 


R. V. Southwell 


Firafc, a quantity Vhv x was introduced, defined by 

V*(V%,) * ZjD] 

and by the boundary condition = 0; j 

then, two quantities V and V of which the boundary values has to be calculated from 


(14*) 


(2fi) 


D ~ = Osin(z, v) —cos (z, 
08 


H 


14] 

(-1 


H'J 


(19*) (26) 


and which throughout the plate are governed by the equations 

0A 
dz 

* dA 

dy 
l + o- 


; + AVHJ+ 2X 1 = 0, 
■ + AV 3 V+2Y 1 = 0, 


(24*) 


in which 
and 


A = 


1 — <r' 


A stands for ~+~, (23*) 

ox oy 


< 26 *) 


(27) 


These last have the forms of equations (2), § 1, so can be treated as though U and V 
were extensional displacements and X v Y x body-forces acting in the plane of the 
plate, once Vho x has been determined from (25), (This is a standard plane-harmonic 
problem.) Then, knowing the distributions of U y V and Vhv 1 throughout the plate, 
we can compute the flexural stresses from 

dhv dhv dU 
dy 3+(r dx 3 ~dx +VhVv 


dhv dhv dv 


dz 3 


2(1 -a) 


dhv 


dU dV 


(26*) 


(28) 


dxdy dy dx' 

since to obtain the stress-couples it is only necessary to multiply the left-hand sides 
of (28) by -D, -Z> and -$Z>. 


Procedure for elimination of cyclic constants (1) from U and V 

18. In Part VIIA a plate having only one (external) boundary was contemplated, 
and it was shown that at this boundary U and V as found from (26) are single-valued, 
the constants of integration having no importance. Neither conclusion holds (in 
general) wh6n the plate is perforated. Then, (1) cyclic constants may occur in the 
expressions for U and F, for reasons similar to those adduced in § 8; (2) constants of 
integration added to U and V at an internal boundary will entail (at points within 
* Asterisks indicate the numbering of equations in Part VII A. 



Rdaocation methods applied to engineering problems 159 

the plate) changes in the expressions on the right of (28) and consequent changes in 
the flexural stresses. 


19. The cyclic constants can be brought to zero by making appropriate changes 
in the boundary condition imposed on Leaving the first of (25) unchanged, 
we now impose the condition , ^ 


at the external boundary only\ and at every internal boundary (severally) we require 
Vhv x to be representable by 

= A 4 Bx + Gy , (29) 

where A, B } C are constants such that* 


il 


H cos (x, v) — (G 4-DVy) sin (x, v) 4* ?/|a 7 4- D~ Vhv x jj 
H sin (x, v) (Q + DV % w x ) cos (x, v) — x^2V + V a w? 1 jJ 


ds = 0, 
ds = 0, ■ 
ds = 0. 


(30) 


Then in equations (26) G has to be replaced by (G + DV*u > x );f and remembering that 

(22) bis 


, . dy 

cos (x, v) » ^ , 


£),x 

— sin(x,v) **—, 


we can integrate (26) to obtain the expressions 

DU = J|(G , + Z)V 2 M? 1 )sin (a;, i>)-#cos(x, v)-y^V + 2>^V*Wj)j<is 

— DV — J*|((3 [ + .DV !, m> 1 )co 8 (x, y) + //sin (x, v) — xliV + D^V^jljds 

+xJ(iV + ]) V*m;j) ds, 

which make (7 and V acyclic in virtue of the relations (30). 

Those relations may be written in the forms (cf. (6), § 4) 


D + Z = 0, 

X> <j> sin (x, y) J ds + L = 0,1 

Z> <j> j V 2 ^ cos (x, v) - x ““ V^j ds + M =» 0, 


(32) 


* Equations (30) could be obtained by elimination of Z, L, M from (5) and (6), §4, after 
w has been replaced by w r . 

t Of. the derivation of equations (17*) in Part VIIA. 



160 


R. V. Southwell 


2, L, M denoting the resultant force (along Oz) and couples (about axes parallel 
to Ox and Oy respectively) which the specified tractions N, G, H exert upon the 
boundary in question. When is given its expression (29), then 


VHv 1 sin (x, v)ds — <j> (A 4- Bx + Gy) sin (x, v)ds j 

j) Vho x cob (Xy v) ds = j) (A + Bx 4- Cy) cos (x, v) ds 
= ~BSy 


(33) 


by Green’s transformation, 8 denoting the area contained within the internal 
boundary.* 

20 . Since — V 2 w 1 (within the plate) satisfies the equation 

Z)V a ( - V 2 ^) + Z = 0 , (25) bis 


it may be interpreted, in accordance with Prandtl’s analogue, as measuring the 
transverse deflexion of a membrane stretched with uniform tension D and loaded 
with a transverse pressure Z having the direction of z. On this understanding, 
0 

Vhu x measures the line-intensity of the transverse force (in the same direction) 

which is exerted by the membrane on the boundary, and (32) accordingly require— 
when combined with (29) and (33)—that a flat plate having the shape of the internal 
boundary, and given a ‘block displacement/ such that a point (x,y) moves trans¬ 
versely through a distance (-4 + J3x + Cy), shall ‘float* under the combined effects 
of the membrane tension and of applied forces and couples measured by Z, (L 4 * CS) 
and (M -f B8). Determination of is thus a plane-potential problem of a type 
which was discussed in Part III (Christopherson & Southwell 1938 ). When it has 
been solved, substitution for V z u\ in (31) will render those expressions single-valued. 
Then, U and V can be computed through the plate from the ‘net approximation* 
to (27), §17, by the same ‘two diagram technique’ which yields displacements 
(u and v ) in Case ‘ B ’ of the extensional problem (Section III); and the stress-oouples, 
if required, can be deduced from (28), 


Procedure for elimination of cyclic constants (2) from w 


21 , But Part VII A, in propounding those expressions, did not take note of the 
possibility that they may not be compatible with the existence of a single-valued dis¬ 
placement ( w ). This possibility we now consider. 

Supposing w to be related with U and V by (28), we can integrate the third of 
those equations to obtain g , ^ , 


V = 



-(1-a) 


dtv 

dy’j 


* The normal v is directed into this area. 


(34) . 



Relaxation methods applied to engineering problems 


161 


rjr denoting some unknown function of x and y. Substitution in the first and second 
equations then gives 

(35) 

showing that ^ is a plane-harmonic function related with the plane-harmonic 
V a w' of Part VII A.* It also gives 


A== lx + fy " ( 1 + <r ) v V-( 1 - <r ) v *“ , i- 


(36) 


22 . According to (34) and (35) 
( 1 - 


fdhv dhv dhtT\ 


* dx dy' dy l J 


r , dU l /dV dU\ 

, ain 

= (l+<r)[v»- s , -j( S + ^). 


r BV di! 

(dV dU\ 

= L¥“ <r ^ +(: y ” “ i(1+<7) 

\0X + dyj’ 


dU dV 

-= -cr— + (1 —<r) W 

dx ay 




(37) 


in virtue of (36). Therefore in relation to a closed boundary dwjdx will he cyclic unless 

Ids 


= -<j> j(l+<r) ^ j + (1 -<7)sin(x,i')(A + 2V 2 tt> 1 )j(i!<, 


(38) 


dw/dy will be cyclic unless 

= - |(1 + <t) ™- (1 -<r) cos (x, v) (A + 2V 2 w>j)j de, (39) 

, Cdw , cw cw [7 v z u' d s w\ 

“ d ”-J* i *-*5 +I s-Jl»ssi +a ’5£) 


ds 


* Comparing (34) with (20*), we have 

dx ~ dy* 

showing that (35) is equivalent to 


d\jf „ ^ 
dy dx* 


= V*w' t where V*^=0. 


This is (15*) of Part VII A. 




162 R. V. Southwell 

will (when dwjdx and dw/dy are single-valued) be eyolio unless 

dhv 


C { l \ / 

0 - 2(1 -or*) j> |cos (*,") (y + *^) - sin (ar, ») (y 

= <j) £( 1 — <r) {y cos {x, i- 1 ) — a; sin (x, (- 1 )} (A + 2 W,) 


d*w 

dxdy + X dx % 


’)h 


( dZJ 0p\ 
dx~dy) 

(1 + or ) {y sin (*, v) - x cos (x, i/)} da. (40) 


23. Writing l, m for cos(x, v), sin(x,p), also A for (In-cr)/(l —rr) in conformity 
with (27), we can state the conditions (38)—(40) as 


X (say) s |“ + + m( A 4* 2V 2 w 1 )/^4 j da = 0 , 

Y (say) A + SV^J/ii) da = 0 , 

N (say) s |a:^~ + 1^) + ~ j + (mx - iy) (A + 2 VhcJjA J da = 0 . 


(41) 


These, which compare with (24), § 15> show that when U, V are treated as extensional 
displacements, and when they have been made acyclic (single-valued) by a proper 
choice of (§ 19), then to make w and its first derivatives single-valued we must 
ensure that X, Y, N shall vanish severally for each internal boundary. (Then they 
will also vanish at the external boundary, since the plate as a whole is in equilibrium. 
This can be shown by using Green’s transformation on (41); for in the resulting 
surface-integrals the integrands vanish in virtue of (27).) 


V. Principles of the relaxational attack 
Misnme 

24. Excluding cases in which the boundary conditions are ‘mixed' (§2), we have 
distinguished four main types of problem concerned with multiply-connected 
(perforated) plates, and we have seen that in all four the requirements at an internal 
boundary amount—either in fact or by analogy—to a specification of external 
actions on a ‘clamp’ which is free to move. 

In Section I (§§ 3-6) such clamps have real existence, and each can execute a rigid- 
body displacement under external actions Z, L, M of which the expressions in 
terms of the transverse displacement («>) are given in (6). 

In Section II (§§7-14) w is replaced by Airy’s stress-function y, which—with its 
first derivatives dx/Sx and dx/dy —need not (like w) be acyclic because only its second 
derivatives have physical meaning . It will be cyclic unless (§9) the tractions on each 
internal boundary, severally, are self-equilibrating; and when it is cyclic, then the 




163 


Relaxation methods applied, to engineering problems 

problem as presented must be modified by removal of the cyclic constants—a 
matter which is discussed in § 10. The modified problem is tractable by Relaxation 
Methods, once values have been given to the constants of integration which appear 
in the boundary values of dxi^x and fixity- the proper values are those which 
entail no self-straining (§11), and these can be determined (§ 14) by the relaxational 
treatment employed in Section I. That is to say, treating x as though it represented 
flexural displacement, and introducing a rigid but mobile ‘clamp’ at each in¬ 
ternal boundary, we can interpret the conditions for no self-straining (namely, 
(17)—(10) of § 13) as requiring every clamp, severally, to ‘float’ in the ideal (flexural) 
problem. 


25. The integral forms of Z, L, M in (6), and of the boundary conditions (17)- 
(19), entail no difficulty in their application to a net of finite mesh. In all 
of them v stands for the normal directed outward from the plate —that is, into 
the closed internal boundary ; consequently Green’s transformation applied to (6) 
yields the expressions 


* jjz(x,y)dxdg, L = jjy.Z(x,y)dxdy, -M = j j x.Z(x,y)dxdy (42) 


when (in accordance with (1) of Section I) Z(x , y) stands for the surface-intensity 
of a transverse pressure imposed upon the clamp, and when the integrations extend 
to its whole area; and similar treatment of the surface-integrals in (17)-(19) gives 
in place of those conditions 

Z»LaM = 0 (43) 


when Z, L, M are given by (42) with 

Z(x,y) = V*{V*x+(l-cr)p£2}. (44) 

In the flexural problem (Section I), Z(x , y) is specified at points within the plate, 
but within the clamp (since this moves as a rigid body) may have any distribution 
compatible, according to (42), with the specified values of Z, L, M. In the extensional 
problem (Section II), Z(x, y) as defined in (44) must vanish at points within the plate, 
since (44) must be identical with (9), § 7: within the clamp it need not vanish every¬ 
where, but its distribution must give zero values for Z, L, M according to (42). In 
either problem, for computation performed on a not of finite mesh, the distributed 
pressure may be replaced (as is usual) by "concentrated forces at nodal points, and 
the surface integrals in (42) by summations extending to all nodal points within 
the clamp. 

‘Clamps’ also make an appearanoe in Sections III and IV (in Section IV by 
analogy); but here they execute rigid-body (‘block’) displacements in the plane of 
the plate. Our treatment contemplates employment of a * two-diagram technique ’. 
The total external action on any clamp is the resultant of X, Y, N as related with 
the displacements by (24), § 15. 


Vo!. tq$« A. 


n 



164 


R. V. Southwell 


26. In the extenaional problem (Section III), values of X, Y, N are specified. 
Again v stands for the normal directed into the internal boundary, so now, applied 
to (24), Green’s transformation gives 


X = 
Y = 
N = 


= jjpXdxdy, 


JJ|/1 V 2 1 ’ + dxdy = j jpYdxdy, 






J J 


-yX)dxdy, 


(45) 


pX t pY denoting (as in (2), § 1) the components of body-force at points within the 
clamp, and the integrations (as in § 26) extending to its whole area. Like Z(x , y) in 
§ 25, JSC and Y may have, within the clamp, any distributions which yield, according 
to (45), the specified values of X, Y, N; and may be replaced by concentrated forces 
at nodal points, in computations performed on nets of finite mesh. 

27. Section IV (§§17-23) makes Case ‘B’ of the flexural problem tractable by 
analogy on the lines of §§15-16—i.e. by a ‘two-diagram technique’; but here 
additional complication is entailed by the necessity of making the displacement ( w ) 
single-valued, and it seems likely that other methods will be found more con¬ 
venient. The wanted quantities are U and V as governed by (27), § 17, and having 
boundary values computed from (31), § 19. In the latter, V 2 w 1 is an ancillary function 
introduced in order to make V and V acyclic: within the plate it must satisfy 

V^V^) - Z/A (29) bis 

and at every internal boundary severally it must be representable by a linear 
expression 

» A + Bx + Cy, 

in which A , ii, C have values such that (32) are satisfied,—i.e. the relations 


Dj>~Vhv t d8 + Z = 0, 
D + L + CS • 0, 

D&z^-VhVids — (M + BS) — 0, 


(46) 


in whioh Z, L, M and S have the meanings explained in § 19. 

With its sign reversed can be interpreted as tension in a membrane loaded 
by transverse pressure ZjD and attached to a light rigid plate at each internal 
boundary; and since in (46) v denotes (as in §§25 and 26) the normal directed into 



Relaxation methods applied to engineering problems 165 


this rigid plate, Green’s transformation applied to the contour integrals gives as 
equivalent relations 




If 


Z(Xy y) dxdy « Z, I y. Z(x, y)dxdy = L, 


■JT- 


2(x, y) dxdy = M, (42) 6 ts 


Z(x, y) denoting the pressure within the area of the plate. 

28. The principles of the relaxational technique (including a use of ‘block dis¬ 
placements 5 to simulate rigid-body displacements of the ‘ floating ’ plates) have been 
explained in preceding papers of this series, so do not call for discussion here. 
Details will be explained most easily in relation to a specific example, and we now 
turn to an extensional problem of the kind discussed in Section II. This will 
exemplify every process that is entailed by the treatments proposed in Sections I, 
III and IV, excepting the relatively simple (‘plane-harmonic’) problem of deter¬ 
mining V 2 ^ in Section IV; and the extensional is the most important, practically, 
of the problems set by perforated plates. 


VI. Details oe the relaxational technique 
An extensional example 

29. The specific problem is shown in figure 4: the stresses induced in a flat plate 
tension member (‘tie’) by longitudinal force distributed uniformly over the outer 
half of either hole. We treat a shape (discussed by Coker & Filon 1931 *) in which 


y 



c = 1*066, and assume the centres 0, O' (figure 4) to be separated by a distance 126. 
For a tie of this length it may be postulated that the central part is subjected to 
simple and uniform tension, and its symmetry permits the problem to be stated 
* Cf. their figure 6*182, from which the left half of figure 4 is reproduced. 


11-2 



106 


R. V, Southwell 


in relation to one-half. Considering the half to right of the oentre-line CC in figure 4, 
we can state the boundary conditions as follows: 

along CC\ = - £P/c, Y v - 0, 

along the remainder of the outer boundary, X v = Y y = 0, 

1 P 

along the arc DEI) of the circular hole, X v = 1^, = 0, 

along the remainder of the circular hole, X u = = 0* 

P then measures the total tension: the total action on the internal (circular) 
boundary is a force 

X = P, (48) 

and an equal and opposite force is exerted by the stresses which act across CC. The 
magnitude of the tension is immaterial, since solutions may be multiplied in accord¬ 
ance with Hooke's law. We give to P and X (in Table 1) the value 10 3 , in order to 
eliminate decimals. 

We adopt a square-mesh relaxation net of mesh-side a = $6. 



Initial modification of the problem as presented 

30. Proceeding in accordance with § 10, we have first to modify the problem as 
presented so as to leave self-equilibrating tractions on the circular boundary; and 
for that purpose we employ the first of the functions (16), with origin at O in figure 4, 
and with the value 10 3 given to P (§29). 

First, from the wanted solution we have to remove the stresses given in column (i) 
of Table 1: that is, from the specified boundary tractions (47) we must deduct , on 
the circular hole, 

X v (= -frcostf) = 10 3 ~J, Y„ (= ~ff sin0) = lO 8 ^, (49; 

and corresponding expressions on the outer boundary; also, throughout the speci¬ 
men, stress-components given by 

X x , X v , Y v * ff x (cos 2 6 , \ sin 28, sin* 6) 

10 3 

= - x (*®, *#*)• (60) 

Figures 6 exhibit values of the deducted stresses at nodal points of the chosen net. 
The specified boundary tractions are (47) with P replaoed by its assumed value 10 s ; 
so the modified boundary tractions can be calculated, and from them acyclic values 
of x> Sy/Sy (that is, of x at nodal points just inside and just outside the boun¬ 
daries) to be imposed as a ‘relative’ specification of the ‘clamp’ displacements in 
the flexural analogue of § 14. 



Relaxation methods applied to engineering problems 


167 













FiGtrctE 66. (X r ), 


















170 


R. V. Southwell 


The relaxational solution 



31. The problem now to be soJved by Relaxation 
Methods is to determine X by a use of the biharmonic 
‘pattern’, permitting such additional ( rigid-body ) 
displacements of the. internal boundaries {clamps) that 
each 'floats 1 freely under zero forces . When this 
problem has been solved, values of the three stress- 
components can be deduced; and in order to obtain 
the wanted stress-system it only remains to add the 
stresses previously deducted— namely, the stresses 
which are recorded in figures 5. 

It is not necessary to explain the now standardized 
process whereby the relaxational solution was ob¬ 
tained. In figures 6 , the first diagram (a) records 
the computed values of x (including those which 
come from displacement of the floating clamps), 
the other three diagrams record the derived values 
of X x , X y and Y y . These latter, which of oourse 
are subject to the unavoidable inaccuracies of a 
method based on finite differences, are in general 
smaller than the deducted stresses (recorded in 
figures 5) which of course are exact. Consequently 
the final solution (obtained by superposition of the 
stresses from figures 5 and 6 ) though not exact may 
be regarded as of high accuracy. 

The final solution 

32. This final solution is presented in figure 7, by 
nodal values and contours of the principal stress- 
difference, a quantity calculable from the formula 

Pi-Pt** <J{(X x -Y v )* + 4A r ®}, (51) 

in which X x , X y , Y y are to be given their values in 
the final (superposed) solution. The directions (0) 
of the principal stresses—exhibited in their figure 
6*182 by Coker & Filon ( 1931 )—could of course be 
calculated from the formula* 

tan 20 =* 2X v l(X x — Y y ), (52) 

but are not recorded hei-e. 

* The formulae (51) and (52) are derived (e.g.) in South 
well 1941 . §136. 


Relaxation methods applied to engineering problems 171 

Conclusion 

33. Though some of its conclusions have been anticipated (e.g.) by Filon ( 1921 ), 
this general discussion of multiply-connected (perforated) plates is believed to be 
for the most part new. It had its origin in questions put to us by an engineer engaged 
in study of the stresses in the frames of locomotives. For theoretical estimates he 
had relied on Relaxation Methods, and had found these satisfactory; but Part VIIA 
gave no guidance in regard to the treatment of 'lightening holes’ (cut with the aim 
of saving weight), and lacking guidance he had been compelled to neglect the holes. 
He believed that no great error had resulted; but now, having arrived at tentative 
conclusions, he desired to remove this source of uncertainty. 

Initially it was suspected that allowance for holes, though feasible in theory, in 
practice would entail prohibitive labour. But on examination these fears proved 
groundless: so far from being prohibitive, the labour entailed by holes was seen to 
be quite small provided that their edges are not subjected to external tractions, 
fairly small provided that these tractions are self-equilibrating, and serious only in 
such problems as the example treated in this paper. These conclusions seemed 
sufficiently important to warrant publication, also some points remarked in the 
inquiry which appeared to have been generally overlooked: e.g. the possibility that 
solutions found by a use of Airy’s stress-function may, in relation to multiply- 
connected (perforated) plates, be inadmissible for the reason that they (tacitly) 
presume 1 self-straining ’. 

34. Those parts of the paper which deal with basic theory had been completed 
by the autumn of 1946, but publication had to wait for the numerical computations 
in Section VI. Foj* these I am indebted to Miss E. I). Lang and to Miss G. Vaisey, 
also to the Clothworkers’ Company for grants which made their collaboration 
possible in the summer of 1947. To Mr D. N. de G. Allen my thanks are due for his 
preparation of the diagrams and for much helpful advice. 

Recently a paper has been published by R. D. Mindlin ( 1946 ) which also discusses 
the analogy between the equations for flexure (‘slabs’) and extension ('slices’), 
with special reference to perforations. It propounds integral relations which are 
confirmed by the conclusions of this paper; but I do not think it makes this paper 
superfluous, since it is not concerned with numerical computation. Mindlin con¬ 
templates a use of flexural experiments to determine extensional stresses: my own 
concern has been to show that the effects of holes can be determined without recourse 
to experiment. 

References 

Christopherson, D. G. & Southwell, K. V. 1938 (‘Part III’) Proc. Roy. Soc . A, 168, 317. 
Coker, E. G. & Filon, L. N. G. 1931 A treatise on photoelasticity . Cambridge Univ. Prose. 
Filon, L. N. G. 1921 Brit . Ass. Rep. (1921), 305-316. 

Fox, L. & Southwell, R. V. 1941 (‘Part VILA’) Phil. Trans. C, 1, 15; A, 239 (1945), 419. 
Love, A. E. H. 1927 The matheniaticaJ. theory of elasticity (4th ed.). Cambridge Univ. Press. 
Mindlin, R. D. 1946 Quart . J . Appl. Math . 4, 279. 

Southwell, R. V. 1941 Introduction to the theory of elasticity (2nd ed.). Oxford Univ. Press. 



The limits of flame propagation at atmospheric pressure 
I. The influence of ‘ promoters ’ 

By Sir Alfred Egerton, Sec.R.S. and J. Fowling, Ph.D. 
(Received 7 November 1947 —Read 11 March 1948) 


The conditions for accurate determination of the limit mixtures for propagation of flame 
having been established, the effect of the addition of 0*5 % of such promoters and inhibitors 
as ethyl nitrate, diethyl peroxide, nitrogen peroxide, ozone, ether, acetaldehyde, ethyl 
alcohol and mothyl iodide on the upper and lower limits of hydrogen-air, methane-air, 
n-butane-air (and propagas-air) mixtures was investigated. It is shown that the effects 
obtained are accounted for if the heat which the additive can contribute to the total heat 
available in the burning of the mixture is included. The only distinct promoting effect was 
that of ethyl nitrate on the upper limits of n-butane and propagas. Methyl iodide had a 
dofinite inhibiting effect except with hydrogen mixtures at the lower limit. The appearance of 
tho flames has been carefully recorded os it gives soma indication of the nature of the com¬ 
bustion process. The general inference is that pre-flame reactions have little influence on the 
limit mixtures, and that those mixtures are maiily determined by the heat available and 
consequently the temperature of the flamo; inflammation being mainly caused by the radicals 
provided by the flame. 

The limit mixtures of ethyl nitrate and nitrite were measured. The effect of replacement of 
oxygon by nitrous oxide on the hydrogen limits, and tho effects of small additions of lead 
tetraethyl arid of 0-5 % ethyl borate on n-butarie limits are also noted. 


1. iNTRODtrCTION 

When mixtures of gases are too ‘weak’ in combustible they cease to propagate 
flame indefinitely, the flame dies out; when they are too ‘rich’, again the flame 
ceases to be propagated. Many investigations have been made on these lower and 
upper ‘ limit mixtures ’: the results of such investigations have been summarized 
by Coward & Jones ( 1928 ) and the summary has been revised by them more 
recently ( 1938 ). One object of the present work was to establish whether or not 
the composition of limit mixtures would be considerably affected by the addition 
of small quantities of other substances, ‘promoters ’ or ‘inhibitors’ of combustion. 
When large quantities of fuel have to be burnt completely and rapidly in as small 
a space as possible (as, for instance, in the combustion ohambers of turbo-jet 
engines) the addition of substances hastening the combustion might be beneficial, 
and if the limits of combustion were widened, the oombustion might be more 
oomplete. A more fundamental object of the work however was to discover more 
about the reactions whioh lead to inflammation in a flame front; at the limits the 
flame speeds are slow and if reactions oocur prior to inflammation they may be 
influenced by ‘promoters’. The current view of flame propagation is not that 
ignition occurs in the unbumt gas owing to the heat conduction from the flame, but 
that a certain critical number of reaction centres giving rise to a minimum rate of 
reaotion must be formed in the flame front. Townend ( 1939 ), for instance, suggests 

[ 172 ] 



The limits of flame propagation at atmospheric presmre . I 173 

that * promoters 9 might therefore have a considerable influence, particularly for 
limit mixtures. The outcome of this investigation, however, has been to show that 
the limits of propagation are mainly governed by the temperature of the flame and 
not to a marked extent by pre-flame reactions. No real advantages are to be 
derived by the addition of so-called ‘promoters’, so far as limit mixtures are 
concerned. 

The investigation divides itself into two parts: part I deals with the effect of 
small additions of different substances on the limit mixture compositions for 
several combustible gases, and part II with the influence of the physical properties 
of mixtures on the limits. An upper limit will be referred to as ‘u.l.’ and a lower 
limit as ‘ 1 . 1 .’. 


2. Exi'erimental 

(a) General conditions for limit determinations 

The considerable discrepancies in the recorded data of limit mixtures are due 
in part to the different conditions in which the measurements were made. It is 
well known that below a certain diameter, the cooling effect of the wall of the tube 
influences the composition of the mixture which will propagate flame. The tubes 
must, therefore, be wide enough for this effect to be negligible. They must also be 
long enough to permit the products to cool behind the flame front, and to allow 
the criterion of continued propagation to be ascertained. White ( 1924 ) has shown 
that, for most mixtures, the wall effect is very small above 5 cm. tube diameter 
and that a length of about 1*5 to 2 m. is sufficient. The mode of ignition makes 
some difference, but a naked coal-gas flame (1 in. jet of flame) drawn across the 
open end at the same time as the base plate is carefully slid away is usually satis¬ 
factory: the other end of the tube is kept closed. The direction of propagation 
influences the limits and must always be stated, for it is not possible to avoid 
entirely the effects of convexion. In upward propagation the reduction in density 
in the flame front tends to favour propagation, the reverse is true for downward 
propagation, horizontal propagation being intermediate with an unsymmetrical 
flame front. Upward propagation gives better flame stability and consequently 
greater accuracy in limit determinations. Initial temperature influences the 
combustion limits, but room-temperature changes of a few degrees are insufficient 
to affect the results appreciably. The presence of water vapour may also influence 
the results, particularly with mixtures containing carbon monoxide, therefore the 
state of dryness of the mixture should be stated. 

( 6 ) Experimental details 

The limits for upward propagation were determined in a vertical pyrex glass 
tube approximately 1-5 m. long and internal diameter 5-3 cm. The leads to mano¬ 
meter, gas storage vessels, oil pump, etc., entered through a cork which closed the 
tube at the top. This was rendered vacuum-tight with a coating of picene wax. 



174 Sir Alfred Egerton and J. Powling 

The bottom end of the tube was fitted with a brass cap which did not constrict 
the mouth of the tube and was closed by a ground brass plate (figure 1). 

The gas mixtures were prepared in the tube by evacuating and admitting each 
component to the required partial pressure as measured by the manometer. The 
small quantities of promoter vapour (0*5 % of the total volume of the mixture) 
were first withdrawn from a small reservoir of the liquid into the evacuated tube. 
Room temperatures were 18±2°C; and all mixtures were dried over calcium 
chloride. 



In order to mix the gases rapidly a small high-speed, vacuum-tight circulating 
fan of the oentrifugal type was connected by 1 cm. bore glass tubes to the ex¬ 
tremities of the combustion tube. The shaft of the fan was tightly packed with 
asbestos mixed with a non-volatile graphite grease. Although the top of the tube 
was strictly not a * closed end the limited size of the vent, representing only about 
3*5 % of the end area, together with the resistance in the leads and fan, prevented 
any appreciable flow of gas from the top of the tube during a combustion. 



The limits of flame propagation at atmospheric pressure . I 175 

A small 1 in. jet of burning coal-gas provided the source of ignition at the lower 
(open) end. The limiting concentration of a fuel in air for complete propagation 
throughout the tube was the mean value of the concentration (in percentage by 
volume of the gas in air) of the two mixtures, one which just completed combustion 
and one which just failed. On average, about twenty combustions were used to 
determine and check each limit. The error given in the tables which follow in the 
text, represents the reproducibility of the results under these experimental con¬ 
ditions. The concentration of promoters was maintained at 0*5% by volume in 
all the experiments; no attempt has been made to study the influence of the 
variation of their concentration. 

(r) Preliminary experiment*# with propagas-air 

It was convenient to start the investigation by making a preliminary survey of 
the effects of various substances upon the limits of inflammability of a sample of 
propagas.* 

The effect of 0*5% by volume of the following additives was tried: 

Diethylperoxide , because of its pronounced ‘ proknock ’ effect in concentrations 
as low as 10 4 mol. and because of the detection of peroxides in slow combustion 
products and even in flames. It was prepared by Baeyer & Villiger’s method, 
b.p. 58 to 59*7° C (625 min.). 

Ethyl nitrate (B.D.H. pure’) because of its similar 1 proknock* effect and its 
probable dissociation to 0 2 H 5 0 radicals, similarly to ethyl nitrite. 

Nitrogen peroxide , because it is a non-combustible peroxide with pronounced 
effect in lowering ignition temperatures. 

Acetaldehyde , because it diminishes the ignition lag of hydrocarbons. 

Ether , for comparison with diethylperoxide. 

Ethyl alcohol , for general comparison. 

(i) Resvlts 

The limits of propagation obtained expressed in percentage by volume of 
propagas in propagas-air mixtures are tabulated in table 1 . 

All the additives lower the 1.1. but, as this is expressed in terms of propagas only, 
no conclusions regarding possible catalytic effects should be drawn since the 
addition of any combustible material would be expected to reduce the amount of 
propagas required in the limit mixture. The nitrogen peroxide does not contribute 
in this way and the reduction of the limit from 2*55 to 2*41 % propagas might 
therefore be ascribed to a catalytic influence. Ethyl nitrate clearly has effect on 
the u.l., which is raised in spite of the increase in combustible material due to the 
promoter itself. Nitrogen peroxide also increases the u.l. slightly. 

The alteration of the limits may be due to two separate influences (a) an increase 
in the combustible material, and ( 6 ) a further catalytic combustion-promoting 


* See footnote to table 1. 



176 


Sir Alfred Egerton and J. Powling 

activity. The two influences may be considered to be supporting each other towards 
the 1.1. and opposing each other at the u.l. Conclusions to be drawn from the 
above results will be discussed later in §3. 


Table 1 


promoter 

1 .1. 

u.l. 

propagasf alone 

2**55 

9*0 

(C,H e ),0, (0-7%) 

1*72 

9*7 

(C>H s ),0 (0-5%) 

2*03 

9*2 

C„H 5 ONO, (0-6%) 

2*22 

12*4 

C,H 6 OH (O’5%) 

2*31 

9*3 

NO, (0-5%) 

2*41 

9*8 

CH,CHO (0’5%) 

2*14 

9*5 

limits of error 

±0*04 

±0*1 

Propagas: C s hydrocarbons trace! 

C s hydrocarbons 96 % 

i 90 % saturated. 


C 4 hydrocarbons 4% 


(ii) Description of the appearance of the flames 

There are considerable differences in the appearance of the flames; it is important 
to record differences as they provide information about the chemical and physical 
changes which occur, but the appearance of the propagas flames were similar to 
those obtained in butane mixtures which are described later (see p. 181). 

(rf) Hydrogen-air 

The preliminary experiments with propagas mixtures were followed by a careful 
measurement of the effect of the same additives on limit mixtures of hydrogen, 
methane, and butane with air all under similar conditions. 

The normal limits of inflammability of hydrogen in air were determined first 
for comparison with values found in the literature, which are many and varied 
and depend upon the conditions in which the measurements were made. Complete 
darkness was essential for observation of the flames which are only weakly luminous. 
At the 1.1. the flame front tends to break into ‘pockets' of combustion, and these 
have to be distinguished from flame caps which rise, diminishing, through the 
tube in mixtures of weaker composition than that of the lower limiting value. 
The criterion for indefinite propagation was that the ‘pockets' of flame should all 
be contained in an area normally occupied by a uniform flame front (as opposed 
to burning individually and irregularly through the gas) and that they should not 
diminish as they progressed up the tube. 

In tubes of 5 cm. diameter, White ( 1924 ) gives the 1.1. for upward propagation 
as 4-15. Coward & Jones ( 1928 ) quote 4*1 ± 0 * 1 , Coward & Brinsley ( 1914 ) made 
many observations in still wider tubes (limit 4*1) and discussed the above criterion 
for certifying the limit. For the u.L for upward propagation, Coward & Jones 
quote 74 ±0*5; White ( 1924 ), 74*5. 



The limits of flame propagation at atmospheric pressure . I 177 

In addition to the substances already mentioned, ozone, which has been found 
to be a strong proknock, was included as a promoter, and methyl iodide as an 
inhibitor. Ozone has been supposed to be a chain carrier in CO oxidation by Lewis 
& von Elbe ( 1937 ); methyl iodide suppresses explosive and slow oxidation of hydro¬ 
carbons, carbon monoxide and hydrogen, by removing hydrogen atoms and other 
free radicals. 

For ozone a modification of the apparatus was, of course, necessary. As far as 
possible glass was used throughout and both ends of the tube were protected by 
hard paraffin wax. The metal fan box had to be eliminated, mixing being effected 
by rocking a glass sphere up and down the tube. 

The ozone was prepared by passing oxygen, dried over P 2 0 6 , through a powerful 
silent discharge across the annular space in a Berthelot tube and was stored over 
concentrated sulphuric acid in a glass gasholder. Decomposition in the storage 
vessel was quite slow, approximately 10 % of the ozone decomposing in 24 hr. 
The dilute ozone was analyzed immediately before use and the appropriate amount 
of oxygen-ozone mixture introduced into the oombustion tube. Oxygen and 
nitrogen were added to bring the gases in the tube, apart from the fuel, to the 
composition of air, the ozone being considered as oxygen. Experiments were 
carried out to determine the maximum amount of ozone decomposed in the gas 
leads and combustion tube before ignition. For 1.1. concentrations decomposition 
was found to be between 6 and 6 % of the ozone. For 1 min. delay in igniting the 
mixture, decomposition increased to about 12 %. for 5 min. delay to 17%. The 
concentration of ozone needed to introduce 0*5 % of the total volume of the mixture 
into the u.l. hydrogen mixtures was higher than could be conveniently obtained. 


(i) Results 


The results for the limits of propagation for hydrogen mixtures are summarized 


in table 2 . 


Table 2 


% vol. H, 


promoter 

1 .1. 

u.l. 

hydrogen alone* 

4-19 

74*6 

(C,H 6 ),0, 

(0-5,%) 

3*82 

68*5 

C,H # ONO, 

(0-5,%) 

3*92 

70*9 

NO, 

(0-2,%) 

4*15 

73*7 

NO, 

(0-5,%) 

4*10 

72*9 

NO, 

(0'9,%) 

4*18 

71*7 

ch 8 i 

(o-s,%) 

4*19 

61*3 

0 , 

(0-4,%) 

3*92 

— 

limits of error 

±0*04 

±0*3 


* (99*5% H,, the remainder for the most part nitrogen.) 


Diethyl peroxide and ethyl nitrate reduce both the upper and lower limits, a 
behaviour compatible with the mere addition of extra combustible material. 
Increasing quantities of NO* lower the u.l. proportionally, but a linear relationship 



178 Sir Alfred Egerton and J. Powling 

is not obtained at the 1.1. These 1.1. figures for N0 2 differ by very little from the 
normal value. The effect of methyl iodide appeared to be confined to the upper limit. 

(ii) Description of the appearance of the flames 

The LI. flames without additives were barely visible and pale blue in colour. 
The flames towards the upper limit were considerably faster. They were also 
slightly more luminous. The appearance of the flames was not altered radically 
by the additives (except methyl iodide), but colour changes were observed and the 
flames were more luminous. 

Ethyl nitrate and nitrogen peroxide imparted the usual yellow-green coloration 
to the flames towards the 1.1., but those near the u.L consisted of a blue flame front 
with the addition of a grey-green tail: the absence of oxygen atoms in the flame 
of the rich mixture is indicated, cf. Gaydon ( 1944 ). It was particularly noticeable 
with ethyl nitrate that the 1.1. flames no longer burned in 'pockets \ With a decrease 
of hydrogen content from 4 0 to 3*7 % approximately (4-0*5% ethyl nitrate), the 
size of the flame decreased proportionally from almost filling the diameter of the 
tube to a small hemispherical flame, about J in. in diameter, rising up the centre 
of the tube. The limit for indefinite propagation was rather difficult to judge; 
3*9 2 % was decided as the likely minimum percentage. A vibratory motion of the 
flame attended nitrogen peroxide additions to u.l. mixtures. 

The change of flame character was most pronounced in the case of methyl 
iodide. Above 5% hydrogen ( + 0 * 5 % methyl iodide) a coherent flame front was 
observed consisting of multicoloured streaks and elongated pockets of flame, the 
predominant colours being orange and green. These became individual pockets and 
diminished in size and number with a decrease in hydrogen concentration until 
the limit was reached, when they began to bum independently throughout the gas. 
The flames near the u.l. in the presence of methyl iodide (the limit was lowered 
considerably) were noisy owing to the resonance of the rapid vibrations of the flame 
front, and consisted of a blue flame front with a long conical orange tail. A long 
tail can be either indicative of a slow after-burning or of a slow cooling of radiating 
products. Hydrogen iodide and iodine were liberated in both the flames. 

(iii) Limits of nitrous oxide-hydrogen mixtures 

Replacement of the oxygen in the air by an equal volume of nitrous oxide was 
found to have no effect on the position of the 1 . 1 ., although the u.l. was reduced far 
beyond the point expected as a result of halving the oxygen cpntent of the system 
(i.e. 1 mol. N 2 0 replacing 1 mol. 0 2 ). Doubling the N a O percentage only brought 
the u.l. to 65*8% hydrogen and narrowed the LI. to a small extent. 


Table 3 



U. 

u.h 

hydrogen and air 

4* 19 

74*6 

hydrogen and air [21 % N t O + 79 % NJ 

4*19 

29*0 

hydrogen and air [42 % N,0 + 58 % N,] 

4*38 

65*8 



The limits of flame propagation at atmospheric pressure. 1 179 

The 1.1. flames were very similar in appearance to the normal hydrogen flames. 
The flames towards the u.l. had an unusual and very definite structure. The hemi¬ 
spherical front was blue and followed by a conical orange glowing tail. They were 
very much slower than the normal u.l. hydrogen flames. 

(e) Methane-air 

The study of the combustion of limit mixtures commenced in Davy’s time with 
measurements made on methane mixtures; ever since then numerous measure¬ 
ments have been made on such mixtures, but the results were very discrepant until 
Coward established the conditions necessary for concordance. The mean of the 
best values in the literature for the limits in a 5 cm. tube for upward propagation 
are 5-3 to 14*3 ; (Coward & Brinsley ( 1914 ) obtained for the 1.1. 5*32 for a 10 in. 
cylindrical tube 7 ft. long.) 

(i) Results 

The effect of additives was similar to that obtained with the hydrogen-air 
system with two exceptions: (a) nitrogen peroxide raised the u.l. from 14-3 to 14*9%. 
The u.l. of hydrogen was progressively lowered by increasing N0 a additions; 
(b) methyl iodide raised the LL from 5*26 to 6*29%, whereas the hydrogen LL was 
unaffected by methyl iodide. The results are given in table 4. 

Table 4 


promoter 

1 .1. 

u.L 

methane alone* 

5-26 

14*3 

(C a H*) a O t 

(0-5,%) 

416 

13*9 

(C a H 5 )ONO a 

(0-5,%) 

4*47 

14*1 

NO a 

(0-5 %) 

4-93 

14*9 


(0-4,%) 

507 

14*5 

<C a H 6 ),0 

(0-3,%) 

402 

13*4 

C a H 6 OH 

(0*6, %) 

4*54 

— 

CH 3 I 

(0-5 %) 

6*29 

12*3 

limits of error 


±0*04 

±0*1 


* Analysis: 99% CH 4 , remainder air. 


(ii) Description of the appearance of the flames 

The normal methane flames towards the LL were blue hemispherical shells, not 
very definitely shaped. They did not quite touch the walls of the vessel and no tail 
was visible. 

The u.l. flames were longer and more definite in shape. A faint dark-red glowing 
tail, fairly short, was visible. As the mixture strength approached the theoretical 
value for complete combustion, the flames acquired the usual green tinge, indicating 
Swan bands, and the tail became more obvious. 

One striking difference was noted in the character of the Ll. flames of methane 
with ethyl nitrate or nitrogen peroxide. As the 1.L was approached, the flame no 
longer assumed its usual form, but became more diffuse and burnt in a long column 


Vol. 193 . A. 


13 



180 Sir Alfred Egerton and J. Powling 

which progressed up the tube. The absence of a definite flame front was conspicuous. 
This occurred to a lesser degree at the u.l. The characteristic emission from the 
reaction NO + 0+*N0 2 4 * hv was apparent in both instances. No secondary reaction 
emitting visible light was noticed behind the flame fronts of methane or hydrogen 
flames in the presence of ethyl nitrate or nitrogen peroxide, whereas with propane 
and butane this was plainly visible. If the glow is due to hot N0 2 , the flame tem¬ 
peratures of limit mixtures of both methane and hydrogen must be considerably 
less than those of propane and butane. In the presenoe of methyl iodide the 1.1. 
flames consisted of a blue-green flame front followed by a long conical orange glow. 


(/) n-Butane-air 

n-Butane was next chosen for study, being typical of the higher normal 
hydrocarbons in its combusion behaviour. For 5 cm. tubes, the limits for upward 
propagation ( 1*86 to 8*41) were given by Coward, Jones, Dunkel & Hess ( 1926 ). 

A sample of pure n -butane was obtained by fractionation of a commercial material 
in a specially constructed oolurnn. The w-butane boiling from 0 to 1*5° C was 
collected and stored in a small cylinder. 


(i) Results 

The limits of combustion of the w-butane with air in the 5*3 cm. tube were found 
to be 1*93 to 9*05% (upward propagation). 


Table 5 

% veil, butane 


promoter 

1 .1. 

u.l. 

n-butane alono 

1*93 

9*05 

(C,H s ),0, 

(0-5,%) 

1*45 

9*18 

C,H 6 ONO, 


1*02 

12*3 

NO, 

(0-5„%) 

1*96 

9*75 

H s CNO, 

(0A%) 

1-82 

9*86 

(C,H 5 ),0 

(0-5 0 %) 

1*46 

8*88 

CHjCHO 

(0-5 0 %) 

1-73 

9*06 

CH,I 

(0-5,%) 

2*11 

6*20 

B(OC,H,) s 

(0’5 # %) 

1-47 

8*6 

Pb(C,H 5 ), 

(trace) 

2*00 

8*87 

Ph.NHCH, 

(trace) 

1*87 

8*91 

limits of error 


± 0*03 

±0*05 


Diethyl peroxide depressed the 1.1., but not more than would be expected from the 
addition of extra combustible (see later). The u.l. was raised more than is obvious 
since the equivalent u.l. of the mixture reckoned as butane would have been 9*63. 

Ethyl nitrate did not lower the 1.1. quite as muoh as diethyl peroxide but the rise 
in the u.l. was considerably greater. This effect does not appear to be due to nitrogen 
peroxide liberated, because addition of the latter has only a small effect on the u,l. 
and hardly any on the 1 . 1 . 



The limits of flame propagation at atmospheric pressure . I 181 

Nitromethane (which probably dissociates into CH 3 and N0 2 ) has much less 
effect than ethyl nitrate, but a rather greater effect on the u.l. than nitrogen 
peroxide when allowance is made for the addition of combustible. 

Ether has about the same effect as ethyl peroxide on the 1.1., but an effect in the 
opposite direction on the u.l. 

Acetaldehyde had only a small effect. 

Ethyl borate had no real promoting effects (see p. 188). 

Lead tetraethyl could only be introduced as a trace (about 0-0003%) since its 
v.p. at 17° C is only 0*216 mm., but both limits were slightly inhibited. 

Summarizing the apparent effects of these additions on the combustion limits 
of butane recorded in table 5, ethyl peroxide and ethjl nitrate widen the com¬ 
bustion range at both limits, ether at the 1.1. only, N() 2 and nitromethane at the 
u.l. only, while methyl iodide narrowed both, particularly the u.l. However, as 
will be shown later, allowing for the thermal effect of the additives, catalytic 
influences seem to be confined to the u.l. 

(ii) Description of the appearance of the flames 

The butane flames towards th(? 1.1. were blue hemispherical shells filling the 
diameter of the tube completely. (Methane and hydrogen 1.1. flames did not reach 
the walls of the tube.) The u.l. flames were of quite a different character being 
longer, less luminous and not touching the walls. Slightly less rich u.l. mixtures 
showed in the centre part of the flame yellow-white luminosity due to incandescent 
carbon particles. 

Diethyl peroxide did not change the 1.1. flames near the limit, but mixtures richer 
than 1*68% easily acquired a vibratory character ending in mild explosion. In 
spite of the greater heat of combustion of butane itself, an equal volume of butane 
did not give rise to such a fierce burning mixture. A dark reddish tail to the flame 
was noticed in mixtures just above the 1.1., but this showed itself even when diethyl 
peroxide was not present. The u.l. mixtures with diethyl peroxide were not visibly 
different from those without the addition. 

Ethyl nitrate gave bright cream coloured 1.1. flames tinged pale green at the rim 
touching the tube walls and followed by an orange glowing tail. The u.l, flames were 
increased in length; the definite flame front was lost and a column of flame moved 
up the centre part of the tube. Mixtures of about 10*4 % butane with 0*5 % ethyl 
nitrate were indistinguishable from the butane flames alone, the absence of the 
characteristic NO a coloration was conspicuous. As the percentage of butane was 
increased, the carbon luminosity in the centre decreased and a faint NO a coloration 
could be observed. Oxygen atoms cannot therefore be playing an important role 
in this u.l. combustion. These weak flames continued to bum in mixtures beyond 
the normal u.l. in spite of the increased fuel concentration due to the ethyl nitrate. 

The change in flame character produced by nitrogen peroxide and nitromethane 
was the same as with ethyl nitrate and so this may be attributed entirely to N0 2 ; 
no NO oxidation coloration was observed in the u.l. flames, but only at the 1.1. 


12-2 



182 


Sir Alfred Egerton and J. Powling 

The addition of ether and acetaldehyde did not ohange the appearanoe of the flames. 

The addition of methyl iodide gave a creamy blue 1.1. flame front with a long deep 
orange tail. The u.l. flames were unusual in shape having a green tinged front with 
a blue rim tailing off into a reddish glow at the base of the flame. 

Ethyl borate caused 1.1. flames to be tinged an intense green and were otherwise 
like those to which methyl iodide had been added. The u.l. flames were unchanged 
by the addition. 

Lead tetraethyl coloured the 1.1. flames grey-blue and provided a long greenish 
white tail. The u.l. flames were again unchanged in appearance. 

(g) Lower limits of ethyl nitrate and, ethyl nitrite 

The 1.1. mixtures of these substances by themselves, as quoted from the literature, 
are discrepant: 

Lower limit of ethyl nitrate 3*8% (Le Chatelier & Boudouard 1898 ). 

Lower limit of ethyl nitrite 3*01 % (A. G. White 1922 ). 

The 1.1. of these two substances were therefore redetermined and found to be: 

4*00% ( ± 0*03) for ethyl nitrate (b.p. 87° C). 

4*11 % ( ± 0*03) for ethyl nitrite (b.p. 17 to 18° C). 

The quite small difference found in the limits is compatible with their minor 
structural differences. 

The flames of both these compounds were similar in appearance to those of lower 
limit butane mixtures containing ethyl nitrate but the colorations were somewhat 
more intense. 

3. Interpretation of the experimental results 

The quotation of limits, in terms of percentage by volume of the fuel only, does 
not truly reveal the catalytic influences that may be operative because of the 
contribution made by some of the additives to the total fuel content. The effective¬ 
ness of a non-combustible promoter is immediately apparent from the limit figures, 
but if it is itself inflammable, then any combustion promoting or inhibiting in¬ 
fluences are masked. At the 1.1. all positive effects with combustible promoters 
would be confused, and at the u.l. all but the more powerful effects, where the 
limit is raised in spite of the increase in fuel concentration, would be concealed. 
For instance, the u.l. in presence of the promoter may be lower than the normal 
limit, yet a higher percentage of total combustible may be present. A basis for 
comparison of the mixtures is required in which the two factors—catalysis and 
change of fuel concentration—may be distingished. 

The usual basis for comparison is to establish whether the mixture obeys 
Le Chatelier's rule that 'all mixtures of limit mixtures are limit mixtures*. This 
rule has been put to the test over a wide variety of substances. Le Chatelier & 
Boudouard ( 1898 ) Bhowed that it held for mixtures of acetylene and carbon 
monoxide which have very different combustion characteristics. Coward, Carpenter 
& Payman ( 1919 ) using upward propagation showed that it held approximately 



The limits of flume propagation at atmospheric pressure . I 183 

for upper and lower limit mixtures of hydrogen, carbon monoxide and methane. 
White ( 1922 ) made an extensive investigation and showed that the rule held for 
many vapour mixtures at both limits, but not always for upward propagation at 
the u.l.; also when substances present in the mixture gave rise to cool flames, then 
the rule did not hold. 

It is rather remarkable that the rule is so comprehensive: it seems that the 
combustion mechanism belonging to a substance which influences the slow com¬ 
bustion or ignition behaviour of another combustible substance, must be having 
very little effect on limit mixtures of such substanoes. Le Chatelier & Boudouard 
( 1898 ), Burgess & Wheeler ( 1911 ) and particularly White ( 1924 ) have all drawn 
attention to the fact that the combustion heat per mol. of limit mixture is nearly 
constant for many different mixtures, and this fact is the significant one in seeking 
an explanation of the wide application of Le Chatelier’s rule. 

It would have been possible to test the application of Le Chatelier’s rule in the 
case of the additives here used, but the quantities added were small and the limits 
of the individual additives were not always available. It was, therefore, preferable 
to find the change in the total combustion heat due to the presence of the com¬ 
pound added. The total heat of combustion of a mixture, although it may not be 
fully operative, is the main factor controlling the limits and is taken as a measure 
of the amount of fuel present in any mixture and provides a means for comparison. 
The two limits will be dealt with separately. 

(i) Lower limit mixtures 

Table 6 gives the calorific values of all the 1.1. mixtures investigated in this work; 
a lower figure for the complex mixture than for the fuel alone indicates a promoting 
influence on the part of the additive. 

Table 6. Lower limit mixtures 



net calorific value 


not calorific value 

1 . 1 , mixture* 

(kcal./mol.) 

1 . 1 . mixture* 

(kcal./mol.) 

hydrogen 

2-42 

propagas 

11*20 

+ (C,H,) 1 0, 

5-38 

+ (C,H t ),0, 

11*66 

+ C.H.ONO, 

3-80 

+ C,H 4 ONO, 

11*30 

+ NO, 

2*40 

+ NO, 

10*67 

+ 0, 

2-42 

+ (C t H 8 ) # 0 

12*02 

+ CH,I 

3*46 


11*75 



+ OH 8 CHO 

10*77 

methane 

10*04 

butane 

12*26 

+ (C,H,),0, 

11*07 

+ (C l H 6 ) t O i 

12*27 

+ C,H,ONO, 

10*00 

H-CtHjONO, 

11*77 

+ N0, 

9*42 

+ NO, 

12*44 

+ 0, 

9*86 

+ (C,H 6 ) t O 

12*32 

+ (C,H,),0 

10*86 

+ H 8 C.NO t 

12*37 

+ C,H,OH 

10*30 

+ CH s CHO 

12*20 

+ CH.I 

12*92 


14*32 


♦ Composition given in previous tables. 



184 


Sir Alfred Egerton and J. Powling 

Figure 2 is obtained by plotting the values of the heat of combustion of the limit 
mixtures against the reciprocals of the 1.1. and drawing as a reference line a curVe 
through 00 and the now fairly well established values of the normal hydrocarbons 
up to n-butane (see part II, pp. 193, 202). The graph gives a means of testing the 
influence of any promoters or inhibitors, because the 0*5% additions would have 



0 1 0 2 0 3 0 4 0 5 0*6 

(lower limit)" 1 


Figure 2. PRO = propagas, □ = figures from literature. 

no appreciable effect on the physical properties of the mixture, and any departure 
outside the experimental error from the line drawn through the values for the 
hydrocarbons would indicate a specific effect on the reaction mechanism. Those 
mixtures corresponding to points which lie to the right of the line may be considered 
as burning with greater readiness than those to the left, since they propagate 
flame in mixtures with less heat available. 



The limits of flame propagation at atmospheric pressure . I 185 

The graph reveals that there are no important catalytic effects at the LI. Some 
minor positive effects are apparent, notably with nitrogen peroxide, but some of 
the additives, in spite of their positive influences on ignition and knock, tend to 
inhibit slightly rather than promote. This was contrary to expectation, since the 
presence of potentially active centres from organic peroxides, nitrates, etc. in the 
heated gas ahead of the flame might have been expected to increase the number of 
chains started and facilitate combustion. The negative effects are not largo except 
with methyl iodide and all points lie quite close to the reference curve. It appears 
therefore that the principal factor governing flame propagation towards the lower 
limit of inflammability is the heat available in the mixture. 

The effect of ozone is also interpreted thermally. It is assumed to decompose 
in the flames which already contain excess oxygen and a high proportion of diluent 
nitrogen, according to o, - 1-50. +M -5 kcal. 

The heats of combustion of the mixtures containing 0 3 were calculated on this 
basis, and for the methane mixture the point lies directly on the reference curve. 

Minor influences of the promoters on the 1.1. as detailed in the following three 
paragraphs are noticeable, although in the main there is little effect. 

In the methane mixtures the peroxide behaved very similarly to ether, both 
slightly inhibiting the reaction. Ethyl nitrate appeared to have a very slight 
promoting influence, but when it is considered that nitrogen peroxide alone had 
a greater effect, no positive activity could be attributed to the ester itself. 

With propagas as the fuel, the behaviour of ether and diethyl peroxide was no 
longer similar; whereas the ether seemed to inhibit the reaction slightly, the 
peroxide had but little effect. The ethyl nitrate again appeared to have but a slight 
positive influence, which was small in comparison with that of nitrogen peroxide. 

In the butane system only the ethyl nitrate had a small promoting influence. 
Nitrogen jKuoxide appeared to have a small opposite effect here, so the activity 
of the ethyl nitrate cannot in this instance be attributed to this part of the 
molecule. 

The figures obtained with hydrogen mixtures are not included on the graph. 
The behaviour of hydrogen in upward propagation is quite exceptional; the flame 
is propagated very readily. Goldman ( 1929 ) has explained the peculiar mode of 
the upward flame propagation in 1 . 1 . hydrogen flames by showing that the com¬ 
bustion is maintained in small rising regions of burning gas by diffusion of the 
hydrogen, in preference to other gases, into those centres of activity; thus the flame 
is maintained with quite a low percentage of hydrogen in the mixture. For down¬ 
ward propagation, the limit is 9-0%, and this figure would be more in keeping 
with the figures used for other gases, as there is only a small difference between 
upward and downward limit compositions for the hydrocarbons. Nevertheless, 
though nearer, this would also be somewhat off the curve (figure 2 ). The figures in 
table 6 , however, show that the effect of the additives is mainly thermal, and that no 
promoting effects are obtained, except a very slight effect with nitrogen peroxide. 



186 Sir Alfred Egerton and J. Powling 

There is no inhibiting effect of methyl iodide on the 1.1. of hydrogen, although it has 
a marked effect on the appearance of the flame. 

(ii) Upper limit mixtures 

There is much more uncertainty in ascertaining the real effect of the additives 
on the upper limits; although all the oxygen is consumed, it is not found in the 
products entirely as carbon dioxide and water. A rather crude estimate of the 
effect of promoters on u.l. mixtures may be made by finding the heat theoretically 
available from the mixtures, compared with that from the fuel-air mixtures without 
additives. A promoting influence is then detected by a figure smaller than that 
for the fuel alone, indicating that flame is propagated in the mixture although 
less heat is liberated per unit volume. 

The heat of reaction in kcal./mol. of the limit mixture is obtained assuming 
that all the oxygen in the very rich mixture is consumed. 

The fraction: 

(% O a ) x (net heat of combustion of fuel/mol.) 

(no. of 0 2 mol. for complete combustion of 1 fuel mol.) x 100 

does not represent the actual amount of heat evolved, for all the oxygen is not 
burnt to C0 2 + H 2 0, but it can be used for comparison of effect when the quantity 
of additive is small. The two columns of figures in table 7 give the available heat 
estimated in two ways. The first is based upon the free oxygen content of the 
mixtures; the second column shows the heats allowing for the dissociation of the 
promoter to simple products stable in rich mixtures. Any oxygen liberated during 
the decomposition is considered to be consumed by the hydrocarbon. The amount 
of heat summated in this way gives a rough figure for comparison with other 
mixtures. The assumed decompositions are: 

(C 2 H 5 ) 2 O a ->C a H 5 OH + CH s CHO + 33 kcal. 

C 2 H 6 ONO a ^CH 8 CHO 4- £H 2 0 + NO + J0 2 -15*8 kcal, 

N(V>N0 + J0 2 - 3*6 kcal. 

CH3NO a ~4C 2 H 6 + NO +10 2 - 29*7 kcal. 

The upper limit mixtures of hydrogen have shown no instance of positive 
catalysis; in fact, all the additions appear to inhibit the reaction. Increasing 
additions of nitrogen peroxide progressively lower the limiting concentration, 
although considerably more heat than would be required to dissociate the nitrogen 
peroxide is available in the mixture. The absence of the characteristic emission of 

NO + O^NOa-h hv 

appears to indicate that inhibition is not due to this reaction. The figures in 
table 7 show that the reduction of hydrogen in the rich mixture is greater than that 
required to compensate for the reduction in free oxygen content due to the additive. 



The limits of flame propagation at atmospheric pressure. I 187 

The effects of the promoters on the methane upper limit are small and indefinite. 
The heat available in mixtures containing diethyl peroxide and ethyl nitrate is 
less than normal if calculated on a free oxygen basis, but taking into account 
decomposition of the promoter, a similar or greater amount of heat is required for 

Table 7 

net heat available 




on free 0 8 basis; 

assuming 
decomposition of 
the promoter; 

mixture 


koal./mol. 
of mixture 

kcal./mol. 
of mixture 

74-6% 

73-7 +0-2, % 

NO, 

hydrogen 

6-18 

0*31 

6*42 

72-9 + 0-5, % 

NO, 


6-45 

0*08 

71-7+ 0-9,% 

NO, 


0*65 

7*00 

08-5 + 0-5% 

(C,H,),0, 


7-53 

7*69 

70-9+ 0-5% 

(C,H s )ONO, 


0-95 

7*00 

01 -3 + 0-5% 

CH 3 1 


9-28 

—* 

14-3% 

13-9 +0-5, % 

(C,H,),0, 

methane 

17-2 

1717 

17*34 

14-1+ 0-5,% 

C,H,ONO, 


17*12 

17*10 

14-9 + 05% 

NO, 


16*96 

17*14 

J 4-5+ 0-4,% 

0, 


— 

17*40 

13-4 + 0-5% 

(C,H s ),0 


17*20 

— 

12-3 + 0-5% 

CH,I 


17*5 

—* 

9-0% 

9-7 + 0-6,% 

<C,H,),0, 

propagas 

10*8 

10*67 

16*98 

12-4 + 0-5% 

C,H,ONO, 


10*18 

10*21 

9-8 + 0-5% 

NO, 


16*08 

10*34 

9-2 + 0-5% 

(C,H,),0 


16*78 

— 

9-3 + 0-5% 

C,H 5 OH 


10*76 

— 

9-05% 

9-18 + 0-5!% 

<C,H,),0, 

butane 

18*60 

18*52 

18*69 

12-3 +0-5% 

0,H,ONO, 


18*10 

18*56 

9-76 + 0-5% 

NO, 


18*41 

18*6 

9-80 + 0-5% 

H,C.NO, 


18-39 

18*33 

8-88 + 0-5% 

(C,H,),0 


18*59 

— 

9-00 + 0-5% 

CHjCHO 


18*53 


0-20 + 0-5% 

CH,I 


19*16 

*— 

8-5 +0-5% 

B(OC,H 4 ) 3 


18*67 

—- 


flame propagation in the mixture compared with the normal limit mixture. No 
significant positive catalysis appears to be present. The small increases due to 
nitrogen peroxide and ozone may be explained by the extra oxygen available from 
the promoter. 

The propagas and buUtne systems are almost identical in their response to the 
promoters. Table 7 indicates that ethyl nitrate exerts a definite positive catalytic 



188 


Sir Alfred Egerton and J. Powling 

influence on the u.l.’s, and that N0 2 and nitromethane have a somewhat similar 
effect. Diethyl peroxide and ether have no appreciable effect. 

It is curious that ethyl nitrate appears to have effect and the peroxide little, 
if any. Its activity is probably not due to oxides of nitrogen, as NO g has been 
shown to have but a small promoting influence. The flames with the ethyl nitrate 
and peroxide additions did not have the same appearance. 

An attempt to find out whether the activity of ethyl nitrate was due to ethoxy 
radicals was made by testing the effect of additions of triethyl borate. There was 
a lowering of the u.L, indicating inhibition if anything, but this might have been 
due to the boron-containing radicals overcompensating for any effects of the ethoxy 
radicals. 

The observations of White & Price (1919) are relevant; they investigated the 
effect of diethyl peroxide and ethyl hydrogen peroxide on the limits of ether-air 
mixtures, and found that the 1.1. was unaffected, but the u.l. (for downward 
propagation) was slightly raised when a considerable quantity of the peroxides 
was added. (The peroxides were counted as ether in the mixtures.) 

Although large positive catalytic effects observed in this work have been limited 
to the effect of ethyl nitrate on the u.l. of the higher hydrocarbons, methyl iodide 
acted as an inhibitor with all the fuels. Both upper and lower limits were affected, 
except the hydrogen lower limit which, in terms of hydrogen, remained unchanged. 
The iodide was dissociated and no doubt the combustible part of the molecule was 
burnt, so even here the 1.1. mixture was richer in fuel than usual. The inhibition 
is not entirely explained on the grounds that it abstracts heat from the system for 
its dissociation, or by its high specific heat. If this were so, the difference in the 
heat available in the limit mixtures with and without methyl iodide would be 
constant and independent of the fuel; table 7 shows that this is not so. The in¬ 
hibiting action of methyl iodide is generally due to the removal of chain carriers 
such as alkyl radicals by the halogen which is the active part of the molecule. 
Flame speeds remain slow as the theoretical mixture is approached if methyl 
iodide is present, but the visible radiation increases in intensity and the length of 
the ‘tail’ radiating after passage of the flame front also increases. West & Miller 
(1940) have found evidence for a resonance transfer of energy from fluorescent 
hydrocarbons in solution to alkyl iodides, which suggests a similar deactivation 
in the gas phase of energy-rich hydrocarbon molecules essential to chain pro¬ 
pagation. 

Although such promoting effects as have been found on lower and upper limits 
have been small, the response to additives has tended to increase in the direction 
of the higher hydrocarbons. 

(iii) Discussion of the appearance of the flames 

The characteristics of the flames have been described carefully in §2. 

All l.L flames for upward propagation were of the same structure, consisting 
solely of thin blue hemispherical shells which remained unchanged by any of the 



The limits of flame propagation at atmospheric pressure . I 189 

additives except methyl iodide and those compounds giving rise to nitrogen 
peroxide. The stability and tenacity of shape of the 1.1. flames increased from 
hydrogen to butane. 

It is clear that with hydrogen and methane when there is excess of oxygen, the 
reactions in the flame are not continued far behind the flame front, and that there 
is appreciable heat loss in the neighbourhood of the walls. With propane and butane, 
the flames are hotter, so there is no gap and the products of the additives are 
excited by the hotter products behind the flame. 

The u.l. flames showed much more variation. Hydrogen flames were almost 
flat, and frequently vibratory. They completely filled the tube. Methane u.l. 
flames showed a tendency to elongate, the blue flame front becoming a paraboloid, 
followed by a short reddish conical tail. Propane and butane flames were quite 
unlike the 1.1. flames; they were very long and the flame front was only faintly 
luminous, although the centre of the flames, especially just beneath the dome of 
the flame, showed the smoky yellow luminosity of incandescent carbon; The 
structures of the u.l. flames were on the whole more sensitive to the additives than 
the 1 . 1 . flames. It is significant that although there may be no influence on the 
limits, the presence of the additive often alters the flame characteristics (e.g. ethyl 
nitrate on hydrogen flames, see p. 178 ). The relation between the upper and lower 
limit flames is considered in part II. 

4. Conclusion 

It has been found that the presence of promoters in the gas ahead of the flame 
has little specific influence on the propagation limits. It is therefore probable 
that the main criterion for inflammation is the maintenance of a sufficient rate of 
reaction and heat release in the flame, providing thereby sufficient active radicals 
to inflame the entering gases. The appearance of the upper and lower limit flames 
show that, after inflammation occurs, the additives may influence the flame 
characteristics, particularly in the upper limit region. Only with methyl iodide 
(inhibitor) and ethyl nitrate (promoter) was there any indication of action prior 
to entry and even in those cases the action may be due to an influence on the rate 
of reaction in the flame and therefore on the flame boundary temperature. The 
conclusions of this work are in agreement with earlier work, particularly that of 
White {loc, cit.) who showed that there was no relation between limits for pro¬ 
pagation of flame and ignition temperature as ordinarily determined. The particular 
reaction mechanisms determining ignition seem to have little connexion with the 
power to propagate flame. 

Since the heat of combustion is the main factor determining the composition of 
the limit mixture, it was of importance to study the physical properties of limit 
mixtures, an extension of the investigation in this direction is described in part II, 
at the end of which acknowledgements are made. 



190 Sir Alfred Egerton and J. Powling 

References 

Burgess, M. J, <fe Wheeler, R. V. 19 x 1 J . Chem. Soc . p. 2013. 

Coward, H. F. & Brinsley, F. 1914 J. Chem . Soc. p, 1869. 

Coward, H. F., Carpenter, C. W. <fe Payman, W. 1919 J . Chem, Soc . p. 27. 

Coward, H. F. & Jones, G. W. 1928 BwM. L 7 .S. Bw. Min . 279 (revised and enlarged 1930 
and 1938). 

Coward, H. F., Jones, G. W., Dunkel, C. G. & Hess, B. E. 1926 Min, Met, Invest. Bull, 30, 
Carnegie Inst. Tech. 

Gay don, A. G. 1944 Proc. Roy. Soc. A, 183, 111. 

Goldman, F. 1929 Z. phys. Chem. B, 5, 307. 

Le Chatelier, H. & Boudouard, O. 1898 CM. Acad. Sci. t Paris , 126, 1344. 

Lewis, B. & von Elbe, G. 1937 ♦/. Amer. Chem. Soc. 59, 2026. 

Townend, D. T. A. 1939 Trans. Inst. Gas Engrs Comm. 221. 

West, W. & Miller, W. E. 1940 J. Chem. Phys. 8 , 849. 

White, A. G. 1922 J. Chem . Soc. pp. 1244, 2661. 

White, A. G. 1924 J . Chem. Soc. p. 2387. 

White, A. G. & Price, T, W. 1919 J. Chem. Soc. p. 1462. 


The limits of flame propagation at atmospheric pressure 
II. The influence of changes in the physical properties 
By Sir Alfred Egerton, Sec.R.S. and J. Powling, Ph.T). 

(Received 7 November 1947 —Read 11 March 1948 ) 

The effect of replacing the nitrogen of the air by other diluents, such as CO a , A and He, on the 
limits of propagation of flame in hydrogen and methane mixtures has been determined. 

The results indicate that the flame is only propagated if a certain minimum temperature can 
be maintained in the flame boundary, which is approximately constant for a given com¬ 
bustible. The specific heat of the mixture and heat of combustion of the combustible con- 
stituents determine whether the flame is propagated; differences of thermal conductivity are 
only great enough in hydrogen mixtures or in mixtures containing helium to have a pro¬ 
nounced influence. 

It is possible to estimate the upper limits in oxygen from the value of the louver limit in 
air for mixtures which are completely burnt, but with the hydrocarbons which are not com¬ 
pletely burnt to CO, such estimate cannot easily be made. The lower limits for n-pentane, 
iso-pentane, n-hexane, n-heptane, n-octane and 2 , 2 ,4-tri-methyl pentane have been oare- 
fully redetermined. The heat of combustion multiplied by the percentage of hydrocarbons 
in the lower limit mixture is not quite constant, but more heat is required to maintain the 
combustion of the higher hydrocarbons. 

Other evidence is mentioned in support of the view that flame limits are determined by a 
certain minimum temperature representing a certain minimum reaction rate at the flame 
boundary, the flame being maintained, not by the pre-heating and consequent reaction of the 
mixture, but chiefly by radicals provided by the flame. 

1 . Introduction 

It is obviously an important problem in combustion chemistry to ascertain the 
particular properties of a gaseous mixture which determine whether or not it will 
propagate flame. Le Chatelier & Boudouard (1898) found that the heat of com¬ 
bustion per unit volume of limit mixtures with air for a number of different gases 




The limits of flame propagation at atmospheric pressure . II 191 

wae approximately a constant, an observation which Burgess & Wheeler (1911), 
White and others have confirmed, but there are many exceptions and the value for 
upper and lower limits are dissimilar. There seems to have been little further suooess 
in elucidating the problem and in predicting whether a given mixture will pro¬ 
pagate flame. In order to develop the conclusions of the previous paper, the influence 
of the physical properties of the gases comprising the limit mixtures.has been further 
investigated. 

In using the determinations of limit mixtures which are to be found in the 
literature, it is necessary to eliminate those which are not made under comparable 
conditions, so the guidance they give as to the factors which govern whether flame 
is propagated or not is rather restricted. 

In spite of a considerable number of investigations on the effect of different inert 
gases on the limit mixtures, it was found necessary to carry out a few additional 
measurements. Coward k Hartwell (1926) have made a fairly extensive study of the 
extinction of methane-air flames by carbon dioxide, nitrogen, helium and argon. 
Measurements have been carried out by others in tubes of rather smaller diameter, 
wherein the cooling and deactivation at the walls became important factors. Con¬ 
sequently the agreement in the literature between the limits obtained with a given 
amount of diluent is unsatisfactory and even the relative order of extinguishing 
power of the various diluents does not agree. Coward k Hartwell (Zoc. cit.) y in experi¬ 
ments with methane in 5 cm. tubes, found the order of extinguishing power for 
nitrogen, carbon dioxide, helium and argon to be C 0 2 > N 2 > He > A, whereas Post¬ 
humus (1930) found the order He > C 0 2 > N a > A. Posthumus, however, used 1*6 cm. 
tubes, and the system studied was H 2 -N 2 0 . Booy (1932) found a similar order 
with methane-air mixtures in 1*7 cm. tubes. Heiningen (1936), however, with 
H a -, CH 4 -, C 4 H 10 -air systems in 2-2 om. tubes, arrived at the order C0 2 > He > N a > A. 
All the above workers agree upon the order C 0 3 > N 2 > A: it is the position of He 
that varies. Therefore, the effects of C 0 2 , He and A upon the limits of inflammability 
of methane and hydrogen were redetermined. Measurements of the lower limits 
(U’s) of hydrocarbons have also been made. 

2. Experimental 

It is useless to compare one propagation limit with another unless the deter¬ 
minations are carried out under similar conditions and unless the diameter of the 
tube is sufficient to eliminate any differences due to effects at the walls. The influence 
of convexion and flame shape on the results obtained experimentally cannot be 
eliminated entirely, but for any given combustible the results will be sufficiently 
comparable under the chosen conditions (except for helium as diluent see p. 201) 
(cf. Coward & Hartwell 1932). 

(a) Limit mixtures with various diluents 

The same apparatus and procedure as described in part I were employed. The 
diluent gases were introduced into the tube with their ratio by volume to oxygen 



192 


Sir Alfred Egerton and J. Powling 

equal to the nitrogen plus argon ratio to oxygen in air. Upward propagation was 
used as before (except for some experiments with hydrogen mixtures where down¬ 
ward propagation was employed). 

The lower limit flames for upward propagation in hydrogen mixtures with argon' 
and helium as diluents were barely visible. They were detected at the top of the 
tube by a thermocouple of thin copper-constantan connected to a galvanometer. 
With experience it was possible to correlate the size of flame (as distinct from in¬ 
dividual pockets of flame) with the kick of the galvanometer, and to determine the 
true limit mixture. 

The limits found are given in table 1. In the experiments with methane the gases 
in the flame tube were analyzed after the passage of the flame, using a Bone and 
Newitt gas analysis apparatus (see table 2). 

Table 1 


atmosphere 

1 . 1 . 

u.L 

Upward propagation 


hydrogen 

He + 21 % 0, 

7-72 

75*7 

CO,+ 21 % 0, 

5-31 

69*8 

N, + 21% 0, (air) 

4-39 

74*6 

A + 21%0, 

3*17 

76*4 

limits of error 

±0*03 

±0*3 

methane 

00,4 21% 0 8 

9*0 

9*0 

N s + 21%0, (air) 

5*26 

14*3 

He 4 21% O a 

4*83 

16*1 

A 4 21 % 0, 

4-01 

17*3 

limits of error 

±0*03 

± 0*1 

Downward propagation 


hydrogen 

CO,+ 21% 0, 

13*1 

— 

N,+ 21 % 0, (air) 

9*0 

— 

He+ 21% 0, 

8*7 

— 

A + 21%0, 

7*0 

— 

limits of error 

±0*05 

— 


Table 2. Gas analysis of combustion products of methane 

MIXTURES IN PER CENT. BY VOLUME 


diluent 

CH« 

CO, 

CO 

H a 

o, 

H,0 

inert 


f A 

0*4 

3*1 

0*7 

0*2 

13*6 

7*4 

74-6 

l.l.j 

‘He 

0*3 

3*3 

0 

1*0 

12*0 

5*8 

77-6 

|N, 

0*5 

4*2 

0*2 

0 

10*5 

8*8 

76-8 


CO, 

0*4 

8*6* 

0*5 

0*8 

3*0 

13*7 

75-0f 

u.l. 

|N, 

1*2 

4*2 

6*7 

7*0 

. 0*9 

14*7 

68-3 

He 

2*4 

2*7 

9*0 

9*8 

1*6 

13*3 

61-2 


LA 

2*6 

2*5 

9*1 

12*0 

1*4 

11*1 

61-3 


* CO t as reaction product. f CO t as diluent. 



The limits of flame propagation at atmospheric pressure . II 193 

For discussion of these results and influence of tube diameter see p. 198 . The 
relative order of extinguishing power of the diluents in upward propagating methane 
mixtures and downward propagating hydrogen is C 0 2 > N 2 > He > A. 

For downward propagation, the flames were not hemispherical but flat and rather 
ragged, tending to set up vibratory effects. Upper limit flames were thin and faster 
in argon and helium mixtures. The flames with carbon dioxide were similar to 
those with nitrogen. With methane, those in argon and helium were thinner and 
rather faster than they were in nitrogen and carbon dioxide. 


(ft) Hydrocarbon-air mixtures 

The values recorded in the literature for the limits of propagation of flame in 
mixtures of the higher hydrocarbons with air are not in good agreement. The 
proportion of combustible in the 1.1. mixture is quite small and the vapour pressure 
low, rendering measurements open to considerable error. While it has been shown 
(part I, p. 184 ) that a graph of the heats of combustion against the reciprocals of 
the l.l.'s for the lower members of the paraffin series up to butane gives a smooth 
curve, the values for the higher members as taken from the literature show no such 
agreement. Careful determinations of the limits for upward propagation at room 
temperature and atmospheric pressure have therefore been made on a few pure 
hydrocarbons up to octane (see table 3 ). The complete curve obtained with these 
results is shown in figure 1. The vapour pressures of the normal paraffins above 
octane were insufficient to provide a 1 . 1 . mixture at room temperature. Pearson’s 
(1931) adaption of the Rayleigh manometer was used to measure accurately the 
small pressures involved. Table 4 provides the determinations relating to the purity 


Tabus 3 


hydrocarbon 

1 .1. 

other values 

methane 

5-26 ± 0-03 

5*24 to 5*45 

n-butane 

1-93 ±0-03 

1*86 

n-pentane 

1*62 ±0-01 

1*3 to 1*5 

iso -pentane 

1*61 ±0*01 

— 

n-hexane 

1*40 ±0*01 

1*25 to 1*41 

n-heptane 

1*26 ± 0*01 

— 

n-ootane 

1*12 ± 0*01 

— 

-octane 

M5 ± 0*01 

— 


Table 4 


substance 

boiling point (°C) 


n-butane 

0 to 1*5 

— 

n-pentane 

36*0 

1*3577 

-pentane 

28*0 

— 

n-hexane 

67-7 to 08*0 

1*3822 

n-heptane 

98*0 to 98*5 

1*3877 

n-octane 

125*0 to 125*5 

1*3976 

iso-octane 

98*5 

1*3915 



104 Sir Alfred Egerton and J. Powling 

of the hydrocarbons used. The range of other values from the literature for the l.l.’s 
(only those determined under similar oonditions) are also given in table 3 for 
comparison. The results are discussed on p. 202. 



3. Disottssion 

(a) General considerations of flame propagation in limit mixture.s 
When flame travels through a stationary inflammable gas mixture, or when a 
similar unburnt mixture flows through a stationary zone of flame, the heat in the 
flame produots is the net heat evolved in the resultant chemical reactions of decom¬ 
position and oombustion at oonstant (or nearly constant) pressure of the cold gas, 
less the heat radiated Mid conducted away at the walls of the containing vessel or 
converted to mass movement of the gas, plus heat from any condensation of water 
on the walls. The flame consists of (1) a region in which the gas mixture, initially 
cold, is being heated by conduction, (2) a region in which combustion is occurring, 
and ( 3 ) a region in which the products are oooling. The first is assumed to be the 
region prior to ignition, the combustion being completed at the commencement of 
Btage ( 3 ); but some ‘slow’ oombustion without inflammation may occur in stage ( 1 ) 



The limits of flame propagation at atmospheric pressure. II 195 

and some combustion may continue in the cooling stage (3). A criterion for flame 
propagation is obviously that T m > T', where T' is a temperature reached at the 
commencement of or early in stage (2), such that inflammation must occur once the 
temperature T' is attained. T m is the maximum temperature of the flame. It used 
to be held that the gases are heated by conduction of heat from layer to layer, and 
at a certain temperature T' the combustion reactions in the heated gas become so 
fast and self-accelerating that inflammation occurs. This view of flame propagation 
assumed that the unbumt gas mixture itself generated reaction centres and that 
ignition occurred when the chain reactions self-aooelerated, as in the ordinary 
passage from slow combustion to ignition. On this basis a theory of flame pro¬ 
pagation was suggested by Mallard* Le Chatelier (1883). Applications of this theory 
have not met with muoh success. White (1922, 1925) pointed out that the ignition 
temperature as determined by ordinary methods are generally much lower than the 
temperatures which he estimated for the flames of limit mixtures, and that the 
velocity of limit flames cannot be determined from such ignition temperatures. 

The experiments described in § 2 show that: 

(а) A greater amount of total available heat is required to maintain flame in the 
1.1. mixtures of higher hydrocarbons than in those of lower hydrocarbons, although 
the higher members are more readily ignited. 

(б) There is no appreciable difference between the oombustion of 1.1. mixtures of 
the isomers, whereas their ignition characteristics are different. Both these observa¬ 
tions provide further evidence against the inflammability of these limit mixtures 
being dependent upon the reactions which lead to self-ignition of the mixture. 

Lower limit flames have speeds generally less than 20cm./seo. and the preflame 
zone at atmospheric pressure is very thin (probably less than 0-05mm.), so that the 
heating up and reaction period is only some 0-0005 to 0-0001 sec. Ignition delays 
are usually much greater than this, for instance, Coward & Hartwell (1926) gave for 
6 % methane in 30 % argon-air 0-19 sec. at 950° C, and the shortest ignition delay 
times were obtained with argon as diluent; the conditions for ignition were, however, 
different from those which exist in free flames. The delay period for auto-ignition of 
the unburnt gas ahead of the flame would thus have to be far shorter than experi¬ 
mental work under ordinary ignition conditions would seem to indicate as 
likely, and the period would have to be little affected by promoters, etc. Further 
experiments on ignition delays at high temperatures are indeed needed (of. Lloyd 
1946). 

The above considerations and the results of the experiments described in part I 
of this paper indicate that the influence of reactions in the preflame zone is not of 
major importance, but that the unburnt gas has first to come into contiguity with 
the flame. The preflame reactions have far less to do with the initiation of the flame 
in the layer entering the flame zone, than active radicals and molecules with high 
energy from the flame itself which are met on entering the flame zone. When onoe. 
the gases encounter a region of high enough temperature containing sufficient con¬ 
centration of these radicals the combustion is initiated and then proceeds rapidly. 


Vol. 193. A. 


13 



196 


Sir Alfred Egerton and J. Powling 

This is not a new conception of flame propagation, for Lewis So von Elbe (1934) 
developed a theory and applied it to the propagation of flame in ozone, in which 
they assume that radioals were diffusing into the unbumt from the burning gas 
(cf. dost & von Muffling 1938). 

The question arises as to whether the maximum flame temperature as suggested by 
White (1925) and Tanaka & Nagai( 1926,1927) must be the temperature which deter¬ 
mines whether the reactions necessary for flame propagation are initiated. Although 
the maximum flame temperature no doubt is related to the temperature necessary 
for propagation, with some mixtures the attainment of the maximum flame tem¬ 
perature may take longer than with others and the position will be farther within 
the inflammation zone. Because of heat loss there will probably be a lower tem¬ 
perature at the forward flame boundary. The attainment of this temperature in the 
boundary region is the criterion for flame propagation at the limit. 

For limit flames, the conditions in the flame front will be discussed with the help 
of figure 2. T Q is the initial temperature of the unburnt mixture and the final 
temperature of the burnt mixture after cooling has taken place behind the flame. 



x 


Figure 2 

T* is a temperature between T 0 and T m which the unburnt gas has to reach if 
reaction proceeds and the mixture continues to burn. If this average temperature 
is not maintained at the flame boundary, flame is not propagated. The temperature 
at which luminosity commences may be at a slightly higher temperature within the 
x t x z zone for it will inevitably take a small fraction of time to develop explosive 
reaction. T' governs the concentration of radioals at the flame boundary . 

T m is the maximum flame temperature given by h/G pt where h is the net heat of 
reaction and C p the mean specific heat of the burnt gas between T m and T 0i assuming 
complete combustion. If T 0 is increased the heat needed to maintain the tempera¬ 
tures T* and T m is less by the amount supplied (viz.: G P (T - T 0 ), where C p is the mean 
specific heat of the reactants, assuming C p varies directly with T)> as has been shown 
by White (1925) to agree closely with experiment. These measurements were not 
made exactly under the same conditions as accepted in this communication, the 



The limits of flame propagation at atmospheric pressure. II 197 

tubes being 2*5 cm. diameter, and so the actual limit mixtures and flame tem¬ 
peratures are slightly different, but using the more recent calorific values and 
specific heats (see table 5 ) the conclusion drawn by White is definite. Agreement 
is not so olose with hydrogen probably because of the effect of tube diameter and 
convexion. 

Table 5 

calorific values 
of limit mixtures 


mixture , 
with air 

at 17° 

K 

at 400° 

h, 

K-K 

^3 “ 

£,(400°-17°) 

iooh-hzh 

H, 

543*5 

3640 

170*5 

272*5 

-17 

CO 

110*3 

770-5 

332*5 

273 

5*5 

ch 4 

1202 

917 

285 

282 

0 

c,h 4 

1088 

789 

299 

283 

0 


At the limits the rise in temperature (T m — T ') will be comparatively small, the 
available heat having been used mainly in heating the gas mixture from T 0 to T\ 
The position where the unburnt gas reaches T[ will be dependent on T m and on 
the gradient between x 3 and x 3 ; the latter will be dependent on the heat loss and on 
the reaction rate. The value of T' for a given combustible will depend on the nature 
of the reaction and the diffusion of radicals at T', though the physical properties of 
the mixture, specific heat and conductivity, may so affect the energy distribution 
that T will appear to be altered. At the limit, any small reduction in composition 
which lowers T m , or any change causing a small increase in heat loss reduces the 
temperature gradient between x 8 and x 2 to zero and T' is no longer maintained; 
change of conductivity will affect the heat loss from the flame. 

The temperature T m as given by hlC p is not necessarily attained because of loss 
of heat from the system by radiation; furthermore, as already stated, reaction may 
not be completed and the composition of the products may not be fixed until 
zone 5 has been reached. Nevertheless, there will be a position, or zone x 3 to in 
which T will be a maximum for a limit mixture and it will bear close relation to 
The energy converted from chemical to thermal energy in the flame is h Q) since 
any small element of the unbumt mixture entering the flame must possess energy 





by an equivalent amount of energy I C p dT having to flow from the flame into 

J T. 


the incoming gas in order to heat it to T’. The process of distribution of the energy 
within the zone x t to x t will depend on the rate of reaction and the diffusivity of the 
products as pointed out by Lewis & von Elbe (1934)- Much has to be learnt about 
the structure of the flame front; T’ even if it can be considered as an average 
condition of the unbumt gas at the flame boundary cannot be readily estimated. 
For the consideration of the behaviour of limit mixtures, particularly lower limit 
mixtures in which the reaction is probably complete in quite a narrow region x t to x a , 
the question whether T m is a sufficient criterion for flame propagation will be 

» 3 -a 



108 


Sir Alfred Egerton and J. Powling 

examined, bearing in mind that the average value of T f will b© a somewhat lower 
temperature. The difference between T m and T f will be at a minimum in limit 
mixtures. For an estimate of T f a knowledge of the maximum temperature, the 
temperature gradient and the heat loss would be needed, or direct spectroseopic 
determination of the temperatures throughout the flame alone and boundary 
region. 

This view does not preclude that with some mixtures, reaction occurs in the pre- 
flame zone and renders inflammation more rapid, or even that a dual process exists; 
but the experimental evidence indicates that the building up of reaction centres in 
the preflame zone of the unbumt gas is not usually important in the short time 
available. Nevertheless, as White (1922), Prettre (1931) and others found, many 
substances can give rise to cool flames at quite low temperatures in certain ranges 
of pressure and Townend & Hsieh (1939) have shown that under oertain conditions 
with certain mixtures, cool flame propagation accompanies ordinary flame pro¬ 
pagation. With cool flames, the process of flame propagation iB more closely related 
to ordinary ignition processes, and the criterion for propagation may then be 
different. 

(b) Discussion of experimental results for hi. mixtures with various diluents 


These considerations can now be applied to the experimental data in § 2. 

The flame temperatures T m are given in column 7 of table 6 for all the mixtures 
investigated. 


1 

2 

3 

Table 6 

4 

5 

6 

7 

8 

mixture 

U. 

H 

h 

Op 

K 

T m 

(°C) 

T m 

(uncorr.) 

(°C) 

H r O # -CO t 

(downward) 

131 

8*95 

7*57 

9*93 

231 

780 


Hg-Og-N, 

9*0 

0*15 

5*21 

7*19 

202 

743 . 

- . 

Hg-OgHe 

8*7 

5*94 

5*03 

5*50 

562 

923 

— 

Hg-Og-A 

7*0 

4*78. 

4*05 

5*50 

163 

754 

— 

CH 4 -Og-COg 

(upward) 

9*0 

19*18 

15*31 

11*94 

107 

1301 

' 1450 

CH 4 -OgNg 

5*26 

11*21 

8*90 

8*11 

150 

1123 

1250 

CH 4 -Og-He 

4*83 

10*29 

7*83 

5*96 

532 

1329 

1515 

ch 4 .o,-a 

4-01 

8*60 

0*39 

0*01 

127 

1081 

1280 


Column 1. Mixture: combustible with diluent and oxygen in same proportion as N, ( + A):0* 
in air. 

Column 2. Percentage of combustible in the 1.L mixture. 

Column 3. Gross heat of combustion in kcal./mol. 

Column 4. Net heat of combustion (corrected) kcal./mol. 

Column 5. Mean molecular heat of products (corrected) to T m . 

Column 0. Conductivity of mixture at 600° C in keel./cm.* sec. cm. °C. x 10*. 

Column 7. Maximum temperature T m = r 0 + £/<?*,, T Q ~ 18° C. 

Column 8. Maximum temperature (uncorrected for incomplete combustion). 



The limits of fame, propagation at atmospheric pressure. II 199 

Although the composition of the cold burnt gases does not provide precise in¬ 
formation for the estimation of T m , the figures in columns 4 and 5 for methane com¬ 
bustion, have been corrected to allow for the results of the analyses given in table 2, 
which indioate that the methane is not quite completely burnt. For the downward 
propagating hydrogen mixtures, the oombustion is known to be nearly complete 
and no correction has been made. No doubt, for methane mixtures, some of the gas 
escapes combustion in the space between the flame boundary and the walls; it has 
not been attempted to make separate allowance for this, but the allowance is 
included in the overall correction. 

Figures are inserted in column 6 to indicate approximate values for the con¬ 
ductivity of the unbumt gases; where experimental values are not available at high 
temperatures, the Sunderland formula for the temperature coefficient has been 
assumed. The influence of conductivity is quite definite for the helium mixtures, 
but the differences in thermal conductivity are not sufficient to account entirely for 
differences in T m for the other mixtures (cf. Coward & Hartwell (1926)). 

In connexion with the neglect of heat losses, during the experiments on the 
upward propagation of hydrogen 1.1. and u.l. flames some experiments were made to 
test to what extent a change in the heat received by radiation to the walls affected 
the results. The walls were blackened in one set of experiments and silvered in the 
other; there was very little difference in the results. 


hydrogen (upward propagation) 

1 .1. 

u.l. 

normal (glass) 

4-19 

74*6 

blackened 

4*12 

74*3 

silvered 

4*17 

74*8 


Leaving aside discussion of the results for helium mixtures, in table 6 the indication 
is that the maximum temperatures are not very different for any 1.1. mixture of 
a given combustible. The value obtained for the C 0 2 mixture with methane is high; 
but the mixture is just at the bend of the 1 . 1 . and u.l. curve (see figure 3 and p. 201 ). 
The results agree quite satisfactorily with the values (see table 7 ) calculated from 
the measurements of Coward & Jones (1938) using mixtures of methane and air 
with the same diluents. The helium mixture is exceptional as before and the 33 % 
nitrogen mixture is also rather close to the bend of the curve. 

The value for the 1 . 1 . methane-air mixture is the same as the value obtained by 
the authors and if similar allowances had been made for unburnt combustible no 
doubt the other values would also agree. 

The results obtained for upward propagation in hydrogen mixtures, in which the 
oombustion is incomplete are so influenced by preferential diffusion of hydrogen into 
pockets of flame that they do not serve for comparison; they are given in table 8 
because they show that the effect of helium as diluent is to counteract by dissipation 
of heat the diffusion of the hydrogen into combustion oentres. The limits for down¬ 
ward and upward propagation in helium mixtures are far closer than for argon 
mixtures. (A nominal temperature, 600 ° C, is used for estimation of C p and K.) 



200 Sir Alfred Egerton and J. Powling 

The measurements of the 1.1. mixtures of hydrogen mid of deuterium with air or 
with 20 % O, and 80 % helium, neon or cdfgon by Clusius & Gutsohmidt (1936) show 
that in helium mixtures the effect of convexion is least, but the dissipation of energy 




Fig hub: 3 






Table 7 




T m 

mixture 

l.L 

h 


<5 P 

(°C) 

19 % COj + air + methane 

60 

11*47 


9*00 

1282 

19 % N g 4- air 4* methane 

5-4 

10*31 


8*14 

1282 

19 % He 4 - air 4- methane 

6*3 

10*13 


7*00 

1338 

19 % A + air 4- methane 

4*85 

9*26 


7*66 

1241 

33 % A 4 - air 4- methane 

4*57 

8*74 


7*14 

1241 

33% N,+air 4 - methane 

5*65 

10*80 


8*19 

1338 

Table 8 . 

Upward propagation 






K 


A/5, 


h 

(for reactants (5, for reactants 

mixture l.L 

(uncorrected) 

at 600° C) 

to 600° C) 

Hj-CO # 6*31 

3*08 


178 


300 

H r N g 4*19 

2*42 


170 


330 

H g *He 7*72 

4*47 


669 


790 

H r A 3*17 

1*83 


132 


330 


from the flame front greatest. In argon mixtures, on the other hand, the convexion 
effect is greatest giving a wide difference between downward and upward propagating 
mixtures, but the dissipation is least and therefore the lowest percentage of hydrogen 
'or deuterium is required for a 1.1. mixture. 



The limits of flame propagation at atmospheric pressure. II 201 

The flame temperatures of the helium mixtures are high in comparison with 
those for other diluents; this is presumably due to the high conductivity of the 
helium which dissipates the energy from the flame front and increases the proportion 
of heat lost. In agreement with this, the authors have found that the lower limit 
of methane with helium diluent is deoreased from 4-83 to 4*5 when the tube diameter 
is increased from 5 to 10 cm., showing that with helium mixtures in 5 cm. tubes 
the effect of the loss of heat to the walls is not yet negligible, aB is nearly so with 
the other diluents. With argon as the diluent, the 1 . 1 . was unaltered by increasing 
the diameter of the tube from 5 to 10 cm. 

The conductivity factor explains the different relative extinguishing power of 
helium in tubes of different diameter which was pointed out in the introduction. In 
tubes of diameter less than 1*7 cm., helium has a greater extinguishing effect than 
carbon dioxide, nitrogen or argon but as the diameter increases its relative effect 
becomes progressively smaller. In 2*2 cm. tubes it is less efficient than carbon dioxide 
but is still better than nitrogen and argon; when the tube diameter reaches 5 cm., 
it is superior only to argon. 

The curves obtained by Coward & Hartwell (1926) for the extinction of flame by 
the addition of nitrogen or carbon dioxide to mixtures of methane and air are given 
in figure 3 . The values of T m along the lower limit boundary are as follows: 

Table 9 


N, % 





CO, % 





added to 

ch 4 



T m 

added to 

ch 4 



Tm 

the air 

(%) 

h 

e. 

(°0) 

the air 

(%) 

h 

5, 

r o) 

0 

5*3 

10-12 

8*10 

1260 

0 

5*3 

10*12 

8*10 

1260 

10 

6*4 

10*31 

8*11 

1290 

5 

5*44 

10*40 

8*34 

1200 

20 

5*5 

10-50 

8*12 

1310 

10 

5*61 

10*72 

8*56 

1270 

30 

5*6 

10*70 

8*15 

1330 

15 

5-8 

11*08 

8*79 

1280 






20 

6*07 

11*59 

9*04 

1300 






23 

0*57 

12*54 

9*28 

1370 


No correction has been made for incomplete combustion as no analyses of end 
products were made. 

The upward trend with increase in diluent is probably due to increasing incom¬ 
pleteness of the oombustion in the flame boundary; for the addition of 20 % of the 
two diluents, the temperatures are almost the same, but in the neighbourhood of 
the bend of the curve the maximum temperature becomes considerably higher (see 
above). Similarly with 1 . 1 . carbon monoxide mixtures diluted with nitrogen or CO a , 
the temperature remains fairly constant at about 1180 ° C rising towards the bend 
of the ourve. For ethane-air mixtures (Jones & Kennedy 1932), and butane-air 
mixtures (Jones & Kennedy 1933), diluted with nitrogen and CO a , some figures are 
given in table 10. 

For butane, the intermediate values can be interpolated in accordance with the 
amount of diluent added. The accuracy of these measurements is insufficient to 
establish whether the increase of T m is real; where there is an apparent increase, it 



202 Sir Alfred Egerton and J. Fowling 

is likely to be due to incomplete combustion in the flame front. The apparent 
increase is greater with carbon dioxide as diluent; CO, and fL ,0 being products, 
addition of either of these gases would decrease the amounts produced in reaction 
and raise the apparent value of T m , unless allowance is made. The same cause may 
account for the rather high value for T m (about 1350 ° C) from the limit values for 
methane mixtures diluted with water (Coward & Gleadall 1930). . 

Though far from complete, the experimental evidence indicates that the m a xim um 
temperature, and therefore the boundary temperature, for 1.1. mixtures of a given 
combustible tends to be constant, apart from minor influences. 


Table 10 


N,% 

added to ethane T m 


the air 

(%) 

H 


<°C) 

0 

3*1 

10*25 

8*26 

1200 

10 

3*1 

10*25 

8*26 

1200 

20 

3*1 

10*25 

8*25 

1260 

30 

31 

10*25 

8*23 

1200 


butane 





(%> 




0 

1*86 

11*82 

8*52 

1410 

35 

2*1 

13*10 

8*01 

1540 


CO, % 


added to 
the air 

ethane 

<%) 

h 

0 , 

T m 

r c) 

0 

— 

— 

— 

1260 

10 

3*32 

11*33 

8*76 

1310 

20 

3*58 

12*21 

9*18 

1350 

30 

4*03 

13*70 

9*88 

1400 

0 

butane 

(%> 



1410 

20 

22 

13*98 

9*01 

1470 


(c) Discussion of limits for propagation in hydrocarbon mixtures 

Burgess & Wheeler (19x1) found that the relationship LC = k (where L is the 
percentage of combustible in the 1 . 1 . mixture, C is the net heat of combustion of the 
fuel and k, constant = 1059 ) held for the paraffins up to pentane. However, the 
values for the limits were not the same as those generally accepted to-day; they 
were determined in a glass sphere with oentral spark ignition and therefore under 
different conditions. The results of the new measurements given on p. 193 show, on 
the contrary, that the value of k increases as the paraffin series is ascended and there 
is therefore an increase in T mi for C p shows only a relatively small change. The values 
inorease asymptotically to about 1600 ° C where the effect of dissociation only 
begins to become appreciable. The increasing values of T m indicate that the higher 
hydrocarbons are progressively more difficult to inflame than the lower, for the 
mixture has to be richer for flame propagation to occur. 

Since the ignition temperatures of the n-paraffins decrease with increase of the 
chain length, the conditions for flame propagation do not seem to depend upon 
the stability or on the ignition characteristics of the compounds but, as indicated 
in the earlier discussion, inflammation ooours when the number of reaction oentres 
derived from the flame are sufficient to inflame the unburnt mixture. Table 11 
compares the values of T m for those hydrocarbon^ the limits of which have been 
determined with their ignition temperatures at 1 atmosphere as measured by 
Townend & Maccormac (1939). 



The limits of flame propagation at atmospheric pressure . II 203 

It is quite probable that the minimum flame boundary temperature T' for the 
inflammation of all the hydrocarbons has about the same value but a slightly higher 
concentration of the larger hydrocarbon molecules are needed to maintain the 
reaction rate in the flame front and therefore T m and T* appear to be higher. The 
temperature distribution in the flame no doubt changes with the complexity of 
the hydrocarbon as is indicated by the character of the flames. Apart from the 
decrease in concentration and its effect on the reaction velocity as the complexity 
increases, molecules of the higher hydrocarbons presumably take longer to be 
broken down and therefore the energy released is not shared so quickly with neigh¬ 
bouring molecules. There is indication of this in the characteristics of upper limit 
flames; some of the hydrocarbon molecules penetrate behind the flame boundary 
before being burnt and so the boundary becomes less pronounced. 

Table 11 

ignition 

T m temperature 


hydrocarbon 

1.1. 

(° C) 

(°C) 

methane 

5*26 

1255 

650 

n-butane 

1*93 

1445 

480 

n-pentane 

1*02 

1485 

470 

wo-pentane 

1*61 

1481 

— 

n-hexane 

1*40 

1508 

275 

n-heptane 

3-20 

1509 

255 

n-octane 

M2 

1575 

245 

iso-octane* 

1*15 

1000 

530 


* 2, 2, 4-tri-methyl pentane. 


The oxidation of the hydrocarbons ( 22 H) is a complicated process and the 
mechanism has not been fully unravelled, but the experimental facts seem to in¬ 
choate that it is governed by the reactions: 

i?H ->• J?~ + H 
R + Oj ROO 
ROO~ + i?H -+ JBOOH + R~ 

i spontaneously breaking to simple , 1 , stabilizing under certain 

products CH, 0 , CO and OH, conditions, or providing 

CH at higher temperature OR or OH in others 

A flame, since it provides OH, CH and C t radicals, etc., is able to initiate inflam¬ 
mation if there are sufficient of such radicals to bring about reaction in a certain 
minimum number of molecules entering the flame boundary, by such reactions as 
OH + J 2 H -* HgO + R~. When inflammation is thus occasioned there is a rapid 
release of energy, the temperature of the flame boundary whioh is losing heat to 
the unburnt gas being thereby maintained. 

The values of the l.l.’s for the olefines lie slightly below the curve for the saturated 
hydrocarbons; the point for toluene lies on the curve but that of benzene is much 



204 


Sir Alfred Egerton and J. Powling 

lower. Acetylene also gives a lower value. It is quite possible that there is some 
slight preflame diBsooiation with the unsaturated hydrocarbons which lowers the 
flame boundary temperature, as is indicated by C—C bands and the appearanoe 
of carbon, and also by large departures from the Le Chatelier mixture rule for 
mixtures of acetylene and hydrogen or hydrocarbons. The values for most other 
carbon compounds lie about the ourve for the hydrocarbons, but other combustible 
substances which readily give rise to cool flame phenomena may have quite different 
values for T m at the limits, for instance, hydrogen sulphide seems to give a value of 
about 705 ° C and carbon disulphide 420 ° C. 

Le Chatelier’s Rule, that mixtures of limit mixtures are limit mixtures, is a useful 
approximation and implies that the several combustibles do not influence each 
other's behaviour in a limit mixture. It holds well for combustibles of the same 
character (e.g. hydrocarbon vapours) and even for gases with such different physical 
properties as hydrogen and methane (in downward propagating mixtures). For 
mixtures which contain combustibles readily exhibiting preflame or cool flame 
combustion, e.g. H a S, CS 2 , etc., the law does not hold, cf. White (1925). 


(d) Relation between upper and lower limits 

Another test of the conclusion that a temperature T\ depending on the specific 
heat and heat of reaction, is the criterion for flame propagation can be made. In 
the u.l. mixtures there is exoess of combustible, the latter itself acting as diluent, 
so if no factor is influencing the limit other than A 0 and 0 p and the excess combustible 
remains largely unchanged after the passage of flame, it should be possible to esti¬ 
mate the value of the u.l. in oxygen by allowing for the change in the molecular 
heat of the mixture due to the substitution of nitrogen by the combustible vapour. 
If x , y and z are the percentages of fuel, oxygen and nitrogen respectively and 
and C pix) are the molecular heats of nitrogen and fuel from 18 ° C to T m (assuming 
T m to be the same for the u.l. mixture as for the U.) then: 


u.l. (O a ) 


[ w fe 3 ‘°° 

x + z°M+y 

c p(x) 


Table 12 

uJ. (O.) 

u.L (air) calc. obs. 

H, 74*0 94*8 94 

CO 74*0 94*5 94 

NH* 27*0 79*9 79 

CH* 14*3 73*5 00 

At the u.l. for hydrocarbons, combustion to CO, is not complete and, without 
a knowledge of the extent of combustion in the flame front and correction, the 
method cannot give values in agreement with observed values, but where com- 



The limits of flame propagation at atmospheric pressure. II 205 

bustion is complete in the flame, the u.l. values estimated from the 1.1.’s agree quite 
well. The relationship between the two limits can be seen by reference to the values 
obtained by Jones (1929) for CO mixtures diluted with nitrogen and carbon dioxide 
(table 13 ). Within the error of the experiments the values of T m are fairly constant, 
though with a tendency for lower values of T m for the u.l.’s. The higher values 
for T m at the Ll.’s are partly due to the rather high value ( 13 * 3 ) for the LI. mixture: 
the LI. for CO is usually given as 12*5 which would make T m *= 1110° C. Complete 
combustion has been assumed. The position of the bend of the curve plotting the 
inflammability limits (see figure 3 ) is slightly on the rich side of the stoichiometrio 
mixture; excess of oxygen would be unfavourable to this reaction. For hydro¬ 
carbon vapours higher than methane a similar tendency is shown, the displacement 
being greater the higher the hydrocarbon in the series. This is presumably 
due to the increase in the molecular proportion of oxygen to hydrocarbon required 
for complete combustion which has the effect of decreasing enoounters between 
the two molecules. A richer mixture is therefore favourable to reaction because 
although the extent of oxidation decreases with increasing' richness of mixture 
the number of collisions between fuel and oxygen molecules increases. A higher 
proportion of incompletely burnt fuel remains in the products and an apparently 
higher value of T m is obtained. 


CO 

O, 

N* 

T m 

<° C) 

Table 13 

CO 

o. 

N, 

CO, 

T m 

(°C) 

720 

6*9 

22*1 

1020 

72*0 

5*9 

22*1 

0 

1020 

45*4 

5*6 

49*0 

1020 

47*9 

0*5 

24*8 

20*8 

1050 

29*5 

5*7 

64*8 

1030 

39*0 

0*8 

25*8 

28*4 

1050 

24*8 

5*7 

09*5 

1030 

27*3 

7*2 

27*3 

38*2 

1000 

20*3 

5*7 

74*0 

1030 

21*8 

7*8 

29*5 

40*9 

1128 

16*8 

5*0 

77*6 

1020 

18*0 

8*0 

32*5 

40*3 

1230 

14*3 

5*9 

79*8 

1070 

10*5 

10*3 

39*2 

34*0 

1220 

13*9 

0*0 

80*1 

1080 

15-5 

12*0 

45*4 

27*3 

1180 

13*7 

9*8 

76*5 

1230 

15*0 

13*2 

49*8 

22*0 

(1220) 

13*4 

13*2 

73*4 

1190 

14*3 

14*4 

55*1 

10*2 

1160 

13*3 

18*1 

68*6 

1180 

14*0 

16*6 

02*7 

0*7 

1190 


(e) Discussion of experimental results for upper limit mixtures with various diluents 

The values for the u.l. for upward propagation of the mixtures investigated are 
given in table 14 . The u.l.’s for upward propagation in hydrogen mixtures are very 
closely the same as for downward. 

The products of combustion of the methane mixtures were analyzed after the 
flame had travelled through the tube; the results were given in table 2. With hydrogen 
at the upper limit, the combustion of the oxygen to water is complete. 

The values of T m for hydrogen are in approximate agreement with the values 
obtained from the measurements by Jones (1929) for a range of limit mixtures in 
which hydrogen: nitrogen proportion is changed (table 15 ). The measurements 
were made in 5 cm. tubes with upward propagation (above 11 % the flame front is 



206 


Sir Alfred Egerton and J. Powling 

normal and little influenced by oonvexion). The value of the u.l. mixture in air was 
slightly lower than that given in table 14 which makes T m rather greater. 


Table 14 


1 

2 

3 

4 

5 

6 

7 

mixture 

u.l. 

H 

h 


K 

r c> 

H.-Oj-CO, 

09*8 

8*67 

7*35 

8*02 

609 

934 

H t -O r N, 

74*6 

7*29 

6*18 

7*00 

642 

902 

H,-0,'He 

75-7 

6*96 

5*90 

6*52 

745 

909 


76*4 

6*78 

5*75 

6*53 

647 

898 

ch 4 -o,-co, 

9*0 

19*18 

15*31 

11*93 

167 

1301 

CH 4 -0,'N, 

14*3 

19*28 

14*40 

8*60 

180 

1581 

CH 4 -0,*H© 

16*1 

18*76 

11*93 

7*41 

516 

1628 

ch 4 -o,-a 

17*3 

18*50 

10*94 

7*00 

165 

1458 

H,-N,0-N, (a) 

29*0 

13*11 

11*53 

8*01 

333 

1458 

H r N,0 N, (6) 

65*8 

12*64 

11*12 

7*84 

581 

1438 


Column 1. Mixture: combustible with diluent and oxygen in same proportion as (N,-f A) :O t 
in air. 

Column 2. Percentage of combustible in u.l. mixture. 

Column 3. Gross heat of combustion of the limit mixture in kcal./mol. (for complete com¬ 
bustion to CO, and water). 

Column 4, Net heat of oombustion (corrected for analysis) in kcal./mol. 

Column 5. Mean molecular heat of products between 18° C and 

Column 6. Conductivity of reacting mixture at 600° C in kcal./tan.* sec. cm. °C. x 10*. 

Column 7. Maximum temperature (uncorrected for heat loss) T m = T<> + hlV p . 



limit mixture 

Table 15 



H, 

O, 

nI 

K 

<°€) 

72*0 

5*9 

22*1 

620 

970 

51*6 

5*6 

42*8 

480 

930 

29*0 

5*3 

65*1 

340 

870 

15*2 

5*1 

79*7 

240 

860 

11*5 

5*1 

83*4 

220 

850 


The trend in these figures is no doubt due to the effect of increasing conductivity 
as the proportion of H 2 increases. 

The values of T m for the u.l. for hydrogen (upward propagation) lie close to^the 
value obtained for the LI. for both downward and upward propagation with helium 
as diluent; the conductivities of the mixtures are comparable and the temperatures 
of the flame front controlling propagation have much the same value in each case. 
One reason for the lower values of T m for the l.l.’s is to be found in the lower heat loss 
with these mixtures, which have a much lower conductivity than helium diluted 
mixtures or u.l. mixtures with excess of hydrogen. 

The bend of the curve expressing the inflammability limits for hydrogen mixtures 
is also in an exceptional position on the weak side of the position for the stoichio¬ 
metric mixture. This is probably due to the high diffusivifcy of hydrogen causing 



The limits of flame propagation at atmospheric pressure *. II 207 

heat to be removed from the flame front, thus a smaller proportion of hydrogen 
favours reaction* 

The values of T m in table 14 for mixtures of N 2 0, hydrogen and nitrogen at the 
u.l. are interesting because, as with other mixtures in which combustion is continued 
well behind the flame front, the estimated values are high. From the character of 
these flames with nitrous oxide it is quite obvious that oxidation is slower than with 
mixtures containing oxygen; the flames themselves are slower and the products 
luminous for some distance behind the flame front. The reaction in the flame front 
cannot be well represented by N 2 0 + H a » N 2 + H a O as assumed and the flame 
temperature is probably considerably lower than is estimated. 

The values of T m in table 14 obtained for the u.l. methane mixtures are higher 
than those for the 1,1. mixtures (except for the CO a mixture where T m is the 
same for both, the point being on the bend of the limit curve). This is probably 
due to slower reactions and to incomplete combustion in the flame front. Although 
corrected, using the results of the analyses of the products, the actual values do not 
mean much; the temperature cannot be determined without knowledge of the 
initial reactions in the flame front, and even so in the first encounter of unburnt 
gas with the flame boundary, the time for reaction may be short compared with the 
time for temperature equilibrium to be established and the local temperature can 
be far higher than the average. 

The flames of the u.l. methane mixtures have a rather different structure from 
those at the 1.1.: the flame is shaped more like a paraboloid and has a faint, short 
conical tail, whereas the 1.1. flame is nearly hemispherical with a rather ragged lower 
edge and no tail. This appearance indicates that reactions continue to take place 
after the first region of contact of the unreacted gases with the flame. These character¬ 
istics are still more notioeable in the flames of higher hydrocarbons. The products 
of reaction at the flame front are both C() 2 and CO, resulting from reactions such 
as CH 4 + 0 a -^(CH 8 02)“>(C0-hH 2 “f-C0 2 *f H 2 0). If the reaction produced C0 2 
from CH 4 + 20 2 -*C0 a + 2H a 0, then with the N a mixture, T m should be about 
1850° C, whereas for CH 4 4-O a -*CO + H a + H a O, T m would be about 1080° C. The 
real value probably lies nearer the latter figure. Reactions which are continued 
behind the flame front give rise to high values of T m . At the u.l., where there is a 
considerable deficiency of oxygen, any molecule of the combustible which is not 
consumed at the flame front may travel some distance before it reaches its final 
degree of oxidation, hence the difference between the structure and temperature 
gradient of upper and lower limit flames. 

This difference is also indicated by the behaviour of inhibitors like methyl iodide 
or bromide, which influence particularly the u.l.’s. These substances considerably 
increase the total thickness of the flame and raise the heat loss to the walls by 
slowing down the reaction in the flame and decreasing the number of active radicals 
available in the flame front to start inflammation. The calculated values of T m 
would consequently be very high. The value of T' would no doubt be somewhat 
higher than it is for an uninhibited mixture, because propagation is not only 



208 


Sir Alfred Egerton and J. Powling 

dependent on the temperature and therefore on the average energy of the molecules 
in the flame front, but also on the radicals in the flame front suitable for initiation 
of the reactions in the unbumt gas. If the neoessary radicals are reduoed in number 
by an inhibitor a higher temperature is needed to induce flame propagation. 

It is curious that the character of the flames of limit mixtures of hydrocarbons in 
the presence of ‘promoters* is often considerably changed without alteration of 
the limit. This is because for the small quantity of ‘promoter* added, the inflamma¬ 
tion temperature of the oombustible is little changed: the effect of the ‘promoter* 
only comes later in the body of the flame after the first stages of reaction. In one case 
however, ethyl nitrate, a definite effect on the upper limit of the combustion of 
butane was observed. If the change of the reaction rate within the flame due to the 
additive is sufficient, it will have its effect on the boundary temperature. Such experi¬ 
ments as have been made indicate that inhibitors such as lead tetraethyl, tin tetra- 
methyl, selenium diethyl (Tanaka& Nagai 1927) and iron carbonyl (Thompson 1932) 
have only a small effect on 1.1.*s, but very definitely narrow the upper limit, as would 
be consistent with the conclusion that their inhibitory action is exerted on the 
reactions within the flame boundary. Dufraisse <fc Le Bras (1936) ascribed the 
extinctive effect of carbon tetrachloride to be due partly to hindrance of the oxida¬ 
tion of the combustible owing to its physical properties and partly to the inhibition 
of the chemical process of oxidation. The diffusion of radicals into the unburnt gas 
being an important feature of the process of flame propagation, the presenoe of large 
molecules with slow average velocity will tend to raise T\ the presence of such 
substances as carbon tetrachloride, and even carbon dioxide, will produce such 
a tendency. The fact that the values for T m are low for mixtures containing hydrogen 
oan in the opposite way be ascribed to the easier diffusion of radicals, 

It is not possible without further knowledge of the initial reaction in the flame 
front to estimate the true values for T m much less for T\ for u.l. mixtures of hydro¬ 
carbons and other organic compounds with air, but as already pointed out when the 
reaction in the flame front is simple it can be shown that there is little difference 
between the value for the u.l.*s and l.l.’s. 

4. Conclusion 

A review of the effects of various diluents on the upper and lower limit mixtures 
for the propagation of flame with various combustibles has led to the oonolusion 
that, whether combustion takes place or not is dependent upon the temperature of 
the flame being sufficient to maintain a certain boundary flame temperature which 
provides a certain concentration of active radicals in the boundary region; inflam¬ 
mation is produced in the unbumt gas mainly from the radicals derived from the 
flame. This does not preclude the possibility that in some oases a double process 
may be occurring, as indicated by Townend*s researches, the one due to cool flame 
reactions in the unburnt gas and the other due to the above process of inflammation: 
but for limit mixtures this is exceptional. It should also not be assumed that, 



The limits of flame propagation at atmospheric pressure . II 209 

because for mixtures near the limits little reaction takes place in the unburnt gas 
before entering the flame, this is neoessarily so for fast burning mixtures. 

It is shown to be possible on this view to estimate approximately whether a given 
mixture of combustibles and diluents should propagate flame or not, provided the 
specific heats are known. This, however, is not possible for u.l. mixtures of hydro¬ 
carbons and organic substances without more knowledge of the reactions in the 
flame front. 

Flame structures, particularly those at low pressures, are being investigated by 
spectroscopic methods in this laboratory. Experiments have also been made with 
promoters on the velocity of flames and the bearing of the above conclusions on the 
mechanism of flame propagation is being further studied. Preliminary results with 
butane-air mixtures have shown that the presence of ethyl nitrate, diethyl peroxide 
and nitrogen peroxide has little influence on the flame speed. The additives, par¬ 
ticularly nitrogen peroxide, appear to retard rather than increase the rate of 
propagation. 

The authors are grateful for discussion with Mr P. Lloyd, to the Gas Light and 
Coke Co. for their grant of the Fellowship which enabled this work to be carried 
out and also to Messrs Rolls Royce Ltd., for further assistance at a later stage. 

References 

Booy, J. 1932 Rec. Trav. chim. Pays-Bas , 49, 309. 

Burgess, M. J. & Wheeler, R. V. 1911 J. Chem. Soc. p. 2013. 

Clusius, K. & Gutschmidt, H. 1936 Z. Electrochem. 42, 498. 

Coward, H. F. <fc Gleadall, J. J. 1930 J . Chem. Soc . p, 243. 

Coward, H. F, & Hartwell, F. J. 1926 J. Chem. Soc. p. 1522. 

Coward, H, F. <fc Hartwell, F. J. 1932 J. Chem . Soc. pp. 1996, 2676. 

Coward, H. F. & Jones, G. W. 1938 J Butt, U.S . Bur. Min. 279. 

Dufraisse, C. & Le Bras, J, 1936 C.R. Acad. Sci., Paris, 202, 227. 
von Heiningen, J. 1936 Rev. Rrav . chim. Pays-Bas , 55, 65. 

Jones, G. W. 1929 Tech. Pap., Bur. Min., Wash,, no. 450. 

Jones, G. W. <fc Kennedy, R. E, 1932 Rep. U.S. Bur. Min . Invest., p. 3172. 

Jones, G. W. <fe Kennedy, R. E, 1933 Rep. U.S. Bur. Min. Invest p. 3216. 

Jost, W. & von Muffling, L. 1938 Z. phys . Chem. A, 181, 208, 

Le Chatelier, H. & Boudouard, O. 1898 C.R. Acad. Sci., Paris, 126, 1344. 

Lewis, B. <fe von Elbe, G. 1934 J. Chem. Phys. 2 , 537. 

Lloyd, P. 1946 6 th Int. Congr . Appl. Meek., Paris , Sept. 1946. 

Mallard, E. & Le Chatelier, H. 1883 Ann. Mines , 8 (4), 289. 

Pearson, T. G. 1931 Z. phys. Cfiem. A, 156, 86 . 

Posthumus, K. 1930 Rec. Trav. chim. Pays-Bas, 49, 309. 

Pretire, M. 1931 Ann. Office Nat . Combust, liq. 6 , 269; 7, 533. 

Tanaka, Y. & Nagai, Y. 1926 Proc. Imp. Acad. Japan, 2, 280, 494. 

Tanaka, Y. <fe Nagai, Y. 1927 Rep. Aeronaut. Res. Inst. Tokyo Imp. Univ. 2, 235, 275. 
Tanaka, Y. & Nagai, Y. 1927 Proc . Imp. Acad. Japan , 3, 434. 

Thompson, H. W. 1932 Z. phys. Chem . B, 18, 219. 

Townend, D. T. A. & Haieh, M. S. 1939 J. Chem. Soc . p. 337. 

Townend, D. T. A. <fc Macoormac, M. 1939 J. Inst . Petrol . Tech . 25, 459. 

White, A, G. 1922 J. Chem. Soc. p. 1244. 

White, A. G. 1925 J . Chem . Soc . pp. 49, 672, 



B axeman Lecttjre 


Some problems in connexion with the development of 
a high speed Diesel engine 

By H. R. Ricabdo, F.R.S.* 

(Delivered 6 November 1947 —Received 12 November 1947) 

[Plate 4] 

When invited to deliver the Bakerian Lecture, I felt very diffident about accepting, 
for I am a mechanic rather than a scientist, and the tale I am going to tell is mostly 
one of mere plumbing and ironmongery. I have chosen, as a theme, my own personal 
experiences over the development of high speed Diesel engines, but I would like to 
emphasize, at the outset, that this was no single-handed effort for, not only had 
I the enthusiastic support of my small but very capable staff, but also the most 
unstinted help and advice from the technical staffs of manufacturing firms, from 
such establishments as the Royal Aircraft Establishment and The National Physical 
Laboratory, combined with financial help and encouragement from the Air Ministry 
and the Shell Company; so that nobody could have had happier or more favourable 
circumstances than I. 

I think perhaps I should try and explain, as briefly as possible, the circumstances. 

We established our experimental laboratory at Shoreham during the closing 
phases of the 1914 War, our intention being to carry out research and development 
work on internal combustion engines, with a view to providing manufacturers with 
advice and up-to-date designs. 

Our first large scale undertaking was a research on volatile fuels for spark ignition 
engines carried out on behalf of the Shell Company and, in parallel, a research into 
the possibilities of the sleeve valve engine for aircraft under a research contract for 
the Air Ministry, 

The research on volatile liquid fuels revealed, among other interesting matters, 
that the tendency of any fuel to detonate was the dominating factor controlling the 
performance of spark ignition engines—this tendency we now define as the octane 
number of the fuel—and also that such engines could not be expected to operate 
satisfactorily on a fuel the final boiling point of which exceeded about 180° C. 

At that date, the heavy demand for petrol of low boiling point had left the oil 
companies with an embarrassingly large proportion of heavier distillates, which 
were becoming a drug in the market. 

The Diesel engine had, of course, long since been evolved to digest these heavier 
fractions, but it had been developed only in the very heavy and low-speed form and 

♦ Now Sir Harry Ricardo. 

[ 210 ] 



Bakerian Lecture 


211 


was, therefore, suitable only for stationary or marine use, but by far the largest 
consumer of fuel was road transport, which was limited to the use of petrol. It 
became apparent, therefore, that there would soon be a orying need for a light high 
speed Diesel engine with an all-round performance comparable with that of the 
petrol engine. Again the Air Ministry were anxious to explore the possibilities of 
a Diesel engine for aircraft. 

And now, perhaps, I had better attempt to define what I mean by the term 
‘high-speed engine’. High speed and low speed are, of course, relative terras and 
there is no obvious dividing line; clearly a high speed engine can run slowly, and 
why should not a low speed engine be made to run fast? The dividing line is to be 
found in an entirely different conception of design. In the low speed engine, the 
dynamic forces set up by the inertia of the pistons and other moving parts are 
relatively small, hence these parts can be made very massive, but dynamic forces 
increase as the square of the speed and, in a high speed engine, they become the 
dominating factor. The essence, therefore, of high speed engine design is the use of 
as rigid a structure as possible containing the lightest possible moving parts, while 
stiffness rather than strength becomes the goal to aim at. 

We realized fully that to compete successfully with the petrol engine on the road 
or in the air, we should have to aim at the development of a Diesel engine which 
could run nearly, if not quite, as fast, would be equally as versatile and flexible and, 
if at all possible, equally simple and economical to maintain. Inevitably the Diesel 
engine would be more expensive to produce but this, we hoped, would be offset by 
its lower fuel consumption. 

In the case of the spark ignition engine, control of power is effected solely by 
varying the quantity, not the quality, of externally carburetted air, hence only the 
pressure scale is varied and the thermal cycle, apart from direct heat losses and 
other secondary factors, is unaffected by variations of power. In that of the Diesel 
engine, the power is varied by cutting off the supply of fuel earlier or later in the 
expansion stroke, thus the smaller the quantity of fuel admitted per cycle, the 
larger the effective expansion ratio and the higher the thermal efficiency. Again, 
as the fuel quantity is reduced, so the mean temperature falls and with it both 
the direct heat losses and those due to change of specific heat and dissociation. 

In the Diesel cycle, therefore, the thermal efficiency increases progressively as 
the power is reduced; this is a vary important consideration as applied to road 
transport where the average operating load is only 30 to 40 %. 

As I have always visualized it, the process of combustion in a petrol engine 
cylinder is somewhat as follows: The spark passes, leaving behind it a thin thread 
of flame; this minute nucleus builds up, slowly at first, somewhat after the manner 
of a soap bubble and, if left undisturbed, would spread thus throughout the whole 
of the combustion chamber, a relatively slow process. It is not left undisturbed, 
for the bulk of the mixture is in a high degree of turbulence so that the flame front 
is torn to shreds which are distributed rapidly throughout the whole combustion 
spaoe, as it were by handing round the fiery torch, and the rate at which general 


Vol. 193. A» 


*4 



212 


H. R. Ricardo 


inflammation then takes place, is controlled almost solely by the intensity of 
turbulenoe. There will thus be first a delay period during the building up of the 
initial nucleus which will be more or less constant in time, followed by a pressure 
rise due to the spread of flame more or less constant in terms of crank angle, see 
figure 1, which shows two typioal indicator diagrams taken from a petrol engine 
cylinder. 



Figure J. Diagrams showing increase of ‘delay’ period with increase of speed. 

In the Diesel engine, the process of combustion is necessarily somewhat different. 
In this case, air alone is compressed in the oylinder to a pressure and temperature 
well above the self ignition temperature of the fuel. Into this highly heated air, the 
fuel is injected as a finely divided liquid spray. As we visualized it, eaoh minute 
droplet would be very quiokly surrounded by an envelope of its own vapour which, 
in turn, would soon beoome inflamed and that, thereafter, the rate of burning would 
depend largely upon the rate at which the now flaming droplet oould both find the 
oxygen it required and be cleansed of its own combustion products; in other words, 
on its motion relative to the air. It would, of course, depend also on the size of the 




Bakerian Lecture 


213 


liquid droplet. We expected, therefore, to find a condition somewhat analogous to 
that in a petrol engine; first a delay period while the envelope of vapour was being 
formed during which a substantial quantity of fuel had entered the cylinder, without 
any appreciable rise of pressure or temperature; this would be followed by a rapid 
rise in pressure when the accumulated droplets became fully inflamed and this, in 
turn, would be followed by a less rapid and more controllable rate of rise when the 
temperature was so high that the droplets would burn as they entered with very 
little delay indeed. We expected, therefore, to realize a pressure time diagram much 
as shown in figure 2, somewhere about half-way between the constant volume and 
the constant pressure cycle. Figure 3 shows an actual diagram taken later from 
a high speed sleeve valve Diesel engine. 

It was clear at the outset that the two major problems before us were an in¬ 
vestigation into the process of combustion and the design and development of a fuel 
injection system which would meter and time accurately the delivery of very minute 
quantities of liquid fuel, for it was quite obvious that the systems then in use on 
large slow running engines would not be applicable. Our first step was clearly the 
design and development of a fuel injection system which would be reasonably easy 
and cheap to produce in very small sizes, and which could be relied upon both to 
meter and time accurately the injection, at very high pressure, of minute quantities 
of liquid fuel of the order of 10 cu.mm,, or even less per cycle and that at speeds of 
at least 2500 r.p.m., and higher if possible. This, though it looked a formidable pro¬ 
blem, in fact proved to be very much easier than we had expected. I thought that 
the best way to tackle it would be to use an ordinary plunger pump with no packing 
but with very long sealing surfaces to prevent leakage, to give this pump relatively 
a very long stroke but to pick out for delivery only a small fraction from 0 to 10 % 
of the mid-portion of the stroke when the plunger was moving at its maximum 
velocity. This was effected by using two cam operated inlet valves and a single 
spring loaded delivery valve. Throughout the whole of the suction stroke of the 
pump, both inlet valves were held wide open. At the end of the suction stroke, one 
valve closed but the other was held open until nearly half-way up the delivery stroke. 
This was then also closed and the entrapped fuel delivered to the injector until, 
a little further up the stroke, the first valve was opened again, thus allowing the 
rest of the fuel to return to the suction side. Thus the time of start of injection was 
determined by the closing of one valve and the period and quantity delivered by 
the re-opening of the other. The timing of both valves was controlled by moveable 
taper wedges between the ends of the valve stems and the tappets. This pump 
worked well from the start and, although we had many teething troubles, due 
mostly to surging and aeration of the fuel, it remained practically unaltered for 
many years. Figure 4 shows this injeotion pump as fitted to a sleeve valve engine. 

As would be expected with a pump of this type, the delivery per stroke tended 
to increase with increase of speed. This rising characteristic was very marked at low 
speeds but tended to flatten out at the higher speeds, owing to the elasticity of the 
fluid under very high pressure. We realized, of course, that this rising characteristic 


14-2 



214 


H. R. Ricardo 


would be a nuisance, in that, when idling, the engine would be unstable and, at any 
fixed setting of the pump, would tend either to run away or to peter out and stop. 
To cope with this instability, we developed a two-speed centrifugal, and later 



25 TDC 25° 

crank angle 


cma 


75° 100 


Figure 2. Diagram showing three phases of combustion process in 
compression ignition engine. 


kmmm 

■HUB 


100 ' 75° 


ERT9 


25° TDC 25° 50° 75° 100° 

crank angle 


Figuhe 3. Single oylinder bore 5J in., stroke 7 in.; speed 2200 r.p.m.; b.m.e.p. 00 lb./sq. in., 
i.m.e.p. 122 Ib./eq. in.; fuel consumption per b.h.p, hr. 0*39 lb., fuel consumption per 
i.h.p. hr. 0*285 lb.; piston speed 2570 ft./min. 




5 



















216 


H. R. Ricardo 


a vacuum governor, the functions of which were to increase automatically the 
delivery of the pump whenever the engine speed fell below a certain pre-determined 
minimum, and to cut it off altogether when the limit of safe maximum speed was 
reached; between these two wide limits, the engine would be operated under direct 
hand or foot control. 

Some years later, but quite independently, the Bosch Company of Stuttgart 
developed and, with prophetic foresight, went boldly into large scale production 
of a small high speed fuel injection pump employing the same principle, but of 
much neater and more compact form. Their pump, which employed also an almost 
identical governor mechanism, had only very short sealing surfaces and therefore 
required a far higher standard of workmanship and material to prevent leakage, 
but to this, they were accustomed. They certainly made a superb job of it and their 
injection pump, like their magneto before it, became a standard piece of equipment 
for all high speed Diesel engines. Had it been available five or six years earlier, it 
would have saved us a lot of time and trouble. 

During this preparatory period, we had been experimenting with and trying to 
develop the single sleeve valve engine for high power aircraft use and had built 
several single cylinder experimental units which, as spark ignition engines, had 
given an excellent performance on petrol. We decided, therefore, to carry out our 
first running trials on the most robust of these units, which had already been 
calibrated very fully. The use of the sleeve valve was particularly attractive for 
our immediate purpose for several reasons, chief of which was that, since the 
cylinder head was encumbered with, valves, we had complete freedom of manoeuvre 
as to the shape or form of combustion chamber. 

Yet another important advantage, from an experimental point of view, was that 
with the admission of air to the cylinder through circumferential ports, we oould, 
by means of external guide vanes, control the direction of entry and subsequent 
movement of the air witliin the cylinder. We had, in fact, already investigated this 
very fully during our previous research on petrol and found that we could, at will, ob¬ 
tain any condition ranging from indiscriminate turbulence to orderly rotational swirl. 

For our first trial, we used the pump we had developed combined with an injector 
nozzle giving a very finely atomized cone shaped spray in a combustion chamber 
of conical form, and a compression ratio of 15:1, for our theories and in fact, all 
theories as to the process of combustion, pointed to the necessity of dividing the 
liquid fuel up into the smallest possible droplets, if we were to hope to get com¬ 
bustion completed in the very short time available. The engine started beautifully 
and ran very smoothly and steadily up to quite a high speed, but developed 
practically no power at all, in fact, only just enough to run itself round. Any 
attempt to increase the power output by delivering more fuel, resulted merely in the 
emission of dense clouds of black smoke from the exhaust and the deposition of 
masses of carbon in the cylinder. 

Investigation revealed that, with very small fuel deliveries, combustion was both 
rapid and complete but that, at best, only about 15 to 20 % of the available 



Bakerian Lecture 


217 


oxygen was combined; if more fuel was injected, all that happened was that the 
excess could not find the necessary oxygen. We concluded, therefore, that we were 
suffering from lack of penetration and that the spray was concentrated into a small 
local zone near the apex of the combustion chamber. We rang all the changes of air 
movement with but little effect and we varied the injection pressure and the rate of 
injection, also with very little effect, but, do what we would, we could not burn 
more than about 20 % of the available oxygen. We therefore decided to abandon 
the pulverizing type of injector and substitute one of the watering-can type; here 
we were faced with the difficulty that, in order to maintain the pressure needed for 
high penetration at low speeds, the holes, in a small engine, would have to be both 
few and very small. The smallest hole we could drill was about 0*2 mm. and the 
greatest number of this size we could use was about 6 or 8—-these we spaced 
out at various angles designed to give as uniform a distribution of the fuel jets 
as possible. 

With this new injector we at once obtained better results. We were able to bum 
twice as much fuel per cycle and to oonsume about 40 % of the available oxygen 
with at least a reasonably clean exhaust and that at a very high thermal efficiency. 
We found also, that the introduction of a moderate degree of orderly rotational air 
swirl was very effective in bringing more air into contact with the fuel jets, and so 
increasing the quantity of oxygen consumed. This was encouraging but still a long 
way off practical politics, for, to compete in power with the petrol engine, which 
can and does consume the whole of its oxygen, we, even witli our much higher 
thermal efficiency, should have to be able to consume at least 75 %, and we were 
a long way off that. Also, we had incessant trouble with choking of the very small 
jets by minute particles of metal or scale, which somehow found their way through 
the best filters we could provide, and once a 0-2 mm. hole was blocked under 
a pressure of about 400 atmospheres, it was virtually impossible to clear it. 

Although the change from a finely pulverizing to a multiple jet injector must 
have increased enormously the droplet size, yet there was no indication that we 
were suffering from unduly prolonged combustion even at speeds up to 2000 r.p.m. 

These experiments, unpromising as they appeared, taught us much, for we had 
learned that rate of burning of the fuel droplets was not a limiting factor, that large 
variations in the coarseness of the spray particles had surprisingly little effect, and 
in short, that the more urgent problem was how to bring the fuel and air into 
contact with each other in the short time available. 

We decided, therfore, as a next step, to go right to the other extreme, to employ 
a single hole injector of the plain fire hose type giving the maximum possible 
penetration quite regardless of pulverization. We had started with a scent spray, 
had done better with a watering can; the next step was to try a fire hose. At the 
same time it was, of course, obvious that to bring the air into contact with a Bingle 
column of liquid, we should require a controlled but very intense air movement; 
the ideal being that the whole of the air within the combustion chamber should be 
made to sweep once past the jet of fuel during the injection period. 



218 


H. R. Ricardo 


W© therefore constructed a new combustion chamber of cylindrical form, the 
diameter being about half that of the piston and the height rather less, and we 
arranged our air entry vanes to give a rotational swirl about the axis of the cylinder. 
We also brought our piston into very close contact with the flat annular portion of 
the cylinder head in order both to reduce as far as possible the volume external to 
the main combustion chamber and at the same time to produce, by ‘squish*, 
a forced vortex in the chamber. In the top and to one side of this cylindrical 
chamber and at about the radius of gyration, we fitted our single hole injector 
squirting straight downwards (see figure 5). The result was an immediate and 
striking improvement; at once we obtained nearly double the power we had reached 
before and we were able to run to the maximum speed of which the engine was 
capable. Indicator diagrams of the pressure changes in the cylinder showed that 
combustion was extremely rapid and this was confirmed by the observation that 
the indicated thermal efficiency actually increased with increase of speed up to the 
highest speed of which the engine was capable—about 2300 r.p.m. 



Figubjbj 5 


Next we found that by bringing the injector nearer and nearer to the side of the 
combustion chamber, we gained progressively more power, until a critical point was 
reached when the spray actually impinged against the side wall. 

We had, at last, achieved some really encouraging results and felt that the next 
step was to explore more thoroughly the air movement in the combustion chamber. 
To this end we developed a small anemometer which could be fitted inside the com¬ 
bustion chamber to record, at least in relative terms, the speed of rotation of the 
air within the chamber. This, of course, could be used only when the engine was 
motored without any fuel. 







Bakerian Lecture 


219 


We found also that by painting the inside of the combustion chamber and the 
crown of the piston with a white enamel paint, allowing it nearly to dry and then 
running the engine for a few seconds under power, we could get clear-cut markings 
both of the air movement, of the impingement, if any, of the fuel jet against the 
crown of the piston and also some indication of the temperature distribution. Thus, 
we learned that even with a plain fire hose jet and a fuel injection pressure of about 
300 atmospheres, the penetration in an air density of 15 atmospheres was only 
just sufficient to reach the piston crown, a matter of less than 2 in. It appeared 
also that no matter how intense the air swirl, the main core of the jet was but little 
deflected while the whiskers of spray were carried inwards towards the centre of the 
chamber. Our anemometer now proved most valuable, in that it enabled us to 
calibrate our external guide vanes in terms of air swirl, which latter we defined as 
the ratio of air to crankshaft revolutions. It proved also that for any given setting, 
this ratio remained practically constant over the whole speed range. 



Figure 6 . Ricardo swirl meter. 


Figure 7 shows the striking effect on power and efficiency made by varying the air 
swirl and it will be seen that the optimum performance was obtained when the 
ratio was between 9 and 10:1 that is to say, when the anemometer within the 
combustion chamber made nine revolutions during one of the crankshaft. The true 
air speed must, of course, have been considerably higher than this, owing to the 
friction, inertia and slip of the anemometer. 

Under the optimum conditions as to swirl, we were able to combine over 70 % of 
the available oxygen and so, with our higher thermal efficiency, to develop about 
85 % of the power obtained from the same engine on petrol, and that with an 
indicated thermal efficiency of approximately 52 %, quite an encouraging per¬ 
formance. 





220 


H. R. Ricardo 


We had now satisfied ourselves on several important points, viz. that engine 
speed need not be limited by rate of burning, that penetration of the fuel jet was 
of far more importance than fine pulverization, and lastly, that it was far easier and 
more effective to bring the air to meet the fuel than the fuel to meet the air. I had 
become wedded, too, to the practical advantages both of a single hole orifice with 
its freedom from choking, and to the use of an intense air swirl not only as a means 
of bringing the air into contact with the fuel stream, but also for the no less 
important function of wiping the dew drop from the nose of a dribbling injector; 
it was clear, however, that so long as we stuck to the single hole injector, we should 
have to employ a very intense air swirl. This was easy enough to obtain in a sleeve 
valve engine but not so easy in a poppet valve. 



crank r.p.m. 
air r.p.m. 

Figuu® 7 

As I have said earlier, our ultimate goal was the development of an engine for 
road transport, but for various reasons, we did not consider the sleeve valve suitable 
for this service and the one form of combustion system we had developed so far was 
applicable only to a sleeve valve engine. I had, however, pinned my faith to the 
sleeve valve where high sustained power was required, as in the case of aircraft and 
heavy duty industrial engines. It was decided, therefore, to proceed with the sleeve 
valve Diesel both as an aero-engine and as a heavy duty industrial engine and at 
the same time to try and evolve from the lessons we had learned, a poppet valve 




Balcerian Lecture 


221 


version for road transport. This latter looked like being a far more difficult problem 
for, with the very high ratio of compression, and consequent small clearance space 
the only possible position for the valves was vertically in the cylinder head, and, 
with this geographical layout, there was no hope of getting adequate air swirl during 
the air admission stroke. The obvious alternative, therefore, was to try and create 
an intense swirl during the compression stroke, by compressing the air into a 
separate chamber communicating by a tangential passage and to do this with the 
minimum possible loss of heat during transfer in or out. For road service, there was 
also another very important factor to be taken into account, namely, the smell of the 
exhaust. With our sleeve valve engines, we had succeeded in getting a completely 
smokeless and invisible exhaust, but it had a very pungent smell due mainly to the 
presence of aldehydes. During our research on fuels, we had found from sampling 
tests, that aldehydes and other products of partial oombustion, were stabilized when 
the flame impinged upon a relatively cool surface. With these considerations in 
mind, we decided to experiment with a separate spherical combustion chamber 
communicating with the cylinder via a tangential passage and, in order to reduce 
both heat losses and smell, we made the lower half of this chamber, containing the 
passageway, as a separate member, heat insulated from the rest of the cylinder. 
Thus it attained a very high temperature somewhere about midway between the 
two extremes of the cycle, while the temperature of its surface against which the 
flame impinged, was too high to permit of the stabilization of the products of partial 
combustion. 

For our earlier experiments in a poppet valve engine, we mounted the fuel 
injector in a spherical trunnion so arranged that we could site the spray in any 
direction within the sphere. 

This arrangement promised well right from the start. We spent a lot of time 
exploring the optimum area of passageway to give the swirl ratio we needed and in 
finding the best direction of the spray which proved to be not straight across the 
diameter but somewhat tangentially in a downstream injection. Once we had found 
the optimum conditions, we obtained an extremely good power output, and were 
able to combine up to 80 % of the oxygen content and so obtain a power output very 
nearly equal to that of the same engine on petrol. Compared with our sleeve valve 
engine results, we were getting about 10 to 15 % more power but, owing to the higher 
heat losses, our specific consumption was about 5 to 10 % greater, though still little 
more than half that of a contemporary petrol engine, when both were running 
under road service conditions. Figure 8 shows an experimental poppet valve 
engine with separate swirl chamber. 

We found also, as we had hoped, that the heat insulated member served another 
and very useful purpose in that its rise of temperature with increase of engine speed 
or load, by raising the compression temperature, compensated for the varying 
delay period of the start of burning and so allowed us to work at all speeds or 
loads with a fixed injection timing, a great advantage from a mechanical point 
of view. 



222 


H. R. Ricardo 



Figure 8. Cross section arrangement of 6 cylinder compression ignition 
engine 4£ in. bore, 5J in. stroke. 



Bakerian Lecture 


223 


By this time, we felt that we had gained sufficient experience to justify our 
issuing designs and working drawings of a sleeve valve version for uses other than 
road service, and also to attempt the design of a full scale Diesel aero-engine. 
Messrs Brotherhood were the first to manufacture and market a range of industrial 
engines to our designs followed by Messrs Mirrlees Bickerton and Day. These were 
of comparatively large size, viz. 60 h.p. per cylinder with numbers of cylinders 
ranging from one to eight and developing up to 400 h.p. These proved successful 
and became very popular, more especially for cases where the duty was very 
arduous, such, for example, as electric generating plants, pumping stations, and 
as auxiliary engines on board ship. 

Figure 9, plate 4 illustrates a six cylinder sleeve valve engine as made by 
Messrs Mirrlees Bickerton and Day designed for direct coupling to either a 
dynamo or centifugal pump. 

Our next venture was the design and construction of a 300 h.p. Diesel aero-engine. 
In this, both Rolls Royce and the Royal Aircraft Establishment co-operated most 
generously; in fact, we built simultaneously, two identical sleeve valve aero-engines, 
one petrol and one Diesel. 

On test, the Diesel (see figure 10, plate 4) developed 330 h.p. at 2300 r.p.m. 
and the petrol version 370 h.p. while both engines weighed a fraction over 700 lb. 
The specific consumption of the Diesel was 0*41 lb. per b.h.p. hour and of the 
petrol 0*0 lb.—a very favourable comparison—but about this time, vast improve¬ 
ments were taking place in the processing and octane number of aviation spirit. 
By taking immediate advantage of these we were able, by supercharging, to step 
up the power of the petrol version from 370 to very nearly 700 h.p. at a cost of 
only 50 lb. additional weight, but owing to the high gas pressures, our slender 
margin of safety did not permit of the Diesel version being supercharged and the 
comparison thus soon became an odious one. Since then, synthetic aviation spirit 
has further improved, thus permitting both of higher ratios of compression with 
correspondingly lower fuel consumption and the use of a still heavier supercharge, 
with the result that the Diesel has now been left far behind in the race for the air. 

While our experimental Diesel aero-engine came to nothing as such, yet the effort 
was by no means wasted, for we learned a great deal from the attempt to make an 
ultra-light and, for that date, an ultra-high-speed Diesel engine, while the engine 
itself, though it never flew, later performed yeoman service in a racing oar driven 
by Oapt. Eyston and captured a number of world records. 

Although by this time we were getting a very satisfactory performance from the 
poppet valve version, we still had many mechanical problems to cope with before 
we could put it forward as a vehicle engine. 

When at last we did feel ready, its immediate success was rather unexpected 
and embarrassing. The new engines of which figure 11, plate 4 shows the first 
prototype did just double the mileage per gallon in London service and proved 
about 00 % better than the petrol engines on long hauls, and, perhaps more 
important still, their drivers were enthusiastic about their ease of handling and 



224 


H. R. Ricardo 


general behaviour, with the result that, in a very short time, we were besieged 
by nearly forty different manufacturing firms all wanting data and designs 
and our thoughts and hands for the next few years were completely occupied in 
getting out designs and in struggling, against time, to solve the outstanding 
mechanical problems. We did make some further improvements on the combustion 
side, which gave us a further increase in power and efficiency, but our main efforts 
were directed to locating and strengthening the weak links in the mechanical 
chain. 

Meanwhile, others and notably Messrs Gardner’s who, in friendly rivalry, had 
taken \vp the same line of development and, starting with relatively stagnant air 
and multi-hole orifices in an open combustion chamber, had succeeded, by heroic 
efforts, in mastering the filtration problem to the extent that they no longer suffered 
seriously from choked jets. They, too, had found that, by introducing a modest 
degree of air swirl, which could be obtained by masking a portion of the inlet valve 
circumference, they could use fewer and larger orifices. By such means they were 
able to reduce the convection heat losses and so obtain a higher overall efficiency 
but at a considerable cost in power. Compared with the single jet swirl chamber 
type, these open chamber engines were able to show from 5 to 10 % lower fuel 
consumption but at a cost of from 15 to 20 % in power output, from the same size 
and weight of engine. To-day, broadly speaking, all high speed Diesel engines built 
in this country, are either of the swirl or open chamber type. We prefer the former 
when the accent is on power, and the latter when it is on fuel economy. 

So rapid was the turnover from petrol, that during the last six years before the 
1039-1945 War, over 20,000,000 h.p. of Diesel engines were placed on the road in 
this country alone. This changed completely the economic picture so far as the oil 
companies were concerned, for Diesel fuel became in shorter supply than petrol, 
and our efforts to-day are directed towards using the still heavier distillates and 
residue oils. 

Thus far, I have spoken only of our problems in connexion with the fuel admission 
and combustion which proved fairly simple and straightforward. Much more diffi¬ 
cult were the purely mechanical and plumbing problems which grew in magnitude 
and diversity as our power output improved. 

Of these, the chief headaches were excessive cylinder wear, overheated pistons, 
cracking of cylinder heads, bearing failures, etc. 

The problem of cylinder wear in internal oombustion engines had long been 
a baffling one, on account both of its inconsistency and of the peculiar form it took, 
namely, that of a deep groove in the bore at the point where the top piston ring 
came to rest at the top of its stroke. The popular belief that wear was caused by 
abrasion due to the introduction of dust, etc. with the entering air, just wouldn’t 
hold water. It would neither explain the extremely localized nature of the wear nor 
the fact that marine engines operating in a dust free atmosphere suffered at least as 
severely, nor would it explain the puzzling observation that single sleeve valves, 
though made of the same material, showed negligible wear. 



Bakerian Lecture 


225 


For long I had suspected that chemical corrosion was at least a contributory 
cause. Our earlier sampling tests from the combustion chamber of an engine had 
revealed the presence in and ahead of the actual flame front, of traces of various 
organic acids, such as formic acid and, though they appeared to be but the transient 
products of partial combustion, I was deeply suspicious of them. In an attempt to 
fit the observed facts, I formed the theory that, at the moment when the piston ring 
came to rest at the top of its stroke with the full gas pressure behind it, only 
boundary, as opposed to fluid, lubrication separated it from the cylinder wall and 
that, under these conditions, the protecting film of oil would be of no more than 
molecular thickness and might locally be broken down altogether; as the piston 
ring gathered way on its downward stroke, it would soon ride up again on a fluid 
film, but would leave behind it a narrow band of unprotected metal exposed to 
attack just at the moment when the concentration of partial products was at its 
maximum, and that the attack would be most virulent if the surface temperature 
was below the dew point of the products in question. This theory might explain the 
extremely localized nature of the wear and the absence of similar wear in the sleeve 
valve engine since, with the combined movement of the sleeve and piston, relative 
motion between the two never ceased and full fluid lubrication might be maintained 
at all times. We tried to obtain some positive evidence to support this theory by 
attempts to stop an engine suddenly in mid-career, whip off the cylinder head and 
find, as I hoped to find, a dry and perhaps rusty band, but, do what we would, we 
could not stop an engine quickly enough to prevent the evidence being wiped away 
by a subsequent stroke of the piston. Some time later, an accident produced just 
the evidence I was in search of. When testing a single cylinder engine, the base of 
the cylinder fractured suddenly, with the result that the whole cylinder flew off and 
landed on the concrete floor where it broke into several pieces. On examining these 
pieces within a couple of minutes of the failure, we found that the whole length of 
the liner was well lubricated with the exception of a narrow, dry and already rusty 
band at the point where the top piston ring came to rest. 

Meanwhile, acting on the corrosion theory, both the Automobile Research Associa¬ 
tion and ourselves experimented with corrosion resisting materials such as stainless 
steels and irons for the cylinder liner, but we found that none of these would mate 
with the piston rings to form a mutual bearing surface; we did, however, have 
some success with a high phosphorous cast iron which showed about half the rate 
of wear of a normal iron. Later, we found that certain stainless steels could be 
surface treated by nitrogen hardening to form a good bearing surface and these 
were almost immune from wear, but both the material and the treatment were very 
costly. We tried, of course, all manner of electro-deposited linings, but only chromium 
appeared at all hopeful; the snag with this was that the surface soon attained a very 
high polish upon which the oil would not spread, with the result that piston seizures 
were liable to occur, with disastrous consequences. Later, Van der Horst, a Dutch 
expert on electro-plating, suggested the possibility of a thick deposit of porous 
chromium obtained by very rapid deposition. This, when tried, proved very promising 



220 


H. R. Ricardo 


at first, but w© found that after several hundred hours running, the surfaoe was 
sometimes liable to acquire a high polish when piston seizure followed. Later, 
a new technique was evolved by Van der Horst which seems, so far, to be entirely 
satisfactory. This consists in first depositing a thick layer of chromium as before, 
machining this to its final dimension by grinding and honing, and then replacing 
the finished liner in the plating bath but with the polarity reversed, thus stripping 
from the finished surface a minute quantity of metal. This leaves the surface 
covered with very minute but deep pits which serve to give the oil film a 
foothold. 

I cannot pretend that even to-day the problem of oylinder wear is solved, but it 
has been reduced to tolerable proportions by fitting thermostats in the water 
circulation in order as quiokly as possible to bring the temperature of the liner 
above the dew point, and by the use of high phosphorous cast iron liners which are 
cheap and easy to renew or regrind, or by the use, when the price will run to it, of 
either nitrided stainless steel or Van der Horst chromium plated liners, both of 
which are expensive initially, but almost everlasting. 

From my earliest days I had been taught that owing both to its higher thermal 
efficiency and lower mean cycle temperature, the flow of heat to the pistons, 
cylinders and exhaust valves in a Diesel engine was much less than in a petrol 
engine. It was a bitter disappointment therefore to find that while the exhaust 
valves and cylinder barrels were no longer a source of worry, the pistons and cylinder 
heads, on the other hand, ran very much hotter, and we at once ran headlong into 
all the troubles such as gummed up piston rings, cracked or seized pistons and 
cracked cylinder heads, which we associate with overheating—indeed, it was rather 
a shock to find that the piston of a Diesel engine reached the same temperature as 
that of a petrol engine when it was developing only about half the power. 

Although the mean temperature of combustion is lower and the mean cycle 
temperature very much lower, yet in the Diesel the actual temperature is both 
very high and very localized, while the rate of heat transfer to the piston and cylinder 
head, due both to the high density and intense convection is much greater than in 
a petrol engine. 

The mechanism of heat transfer from the crown of the piston to the cylinder 
walls is still a little obscure, but all temperature gradient measurements indicate 
that the bulk of the heat is transmitted via the piston rings. In large Diesel engines 
the pistons are hollow and water or oil oooled as a matter of course, but in a small 
high speed engine the problem of liquid cooling the pistons is a very formidable 
one. The only alternative was to increase the heat path by thickening greatly the 
aluminium in both the crown and the upper part of the skirt and by adding more 
piston rings merely for purposes of heat transfer. By so doing, we increased sub¬ 
stantially both the dynamic forces due to the inertia of pistons and the internal 
friction. To cope with the dynamic forces, as also with the somewhat higher gas 
pressures, we had both to increase the rigidity of the engine structure and to 
enlarge and stiffen all the bearings. 



Bakerian Lecture 


227 


Other tiresome difficulties have been caused by cracking of cylinder heads due to 
localized and unequal heat flow. By plotting temperature gradients through the 
metal, we were able to identify the zones of intense heat flow and, in most cases, to 
cope with them either by appropriate distribution of the metal or by directing the 
flow of the oooling circulation, or both. 

.Again, crankshaft torsional vibration proved very troublesome. This worry has 
been with us always, but was very much more acute in the Diesel engine, due to the 
high compression and to the heavier pistons. The well-known Lanchester damper 
will cope with most cases of torsional vibration in the petrol engine, but will not 
stand up to the Diesel; we had, therefore, to employ much larger diameter and 
stifler crankshafts and, in some cases, to resort to the use of pendulum dampers 
and other methods of de-tuning. 

Yet again, owing to the large diameter crankshaft bearings we were compelled 
to use, the rubbing velocity became so high and the heat generated by friotion so 
great, that the ordinary anti-friction linings broke down. To combat this, we had to 
increase greatly the flow of cooling oil through the bearings but this, in turn, led us 
into difficulty with excessive oil passing the pistons. We came to the conclusion, 
therefore, that since only a small amount of oil is required for lubrication as such, 
and a very large quantity for cooling, it would be better to separate the two 
functions and to cool the crankshaft internally by flushing it through from end to 
end with a copious flow of oil. This proved very easy and simple to apply in 
practice (see figure 12). 



Fioubb 12. Lubricating arrangement of 6 cylinder compression ignition engine. 


All these and many other mechanical problems proved far more elusive and 
difficult than any connected with the combustion system; moreover, they were not 
so soda apparent, but reared their ugly heads progressively as the combustion 
system and therefore the power output improved. To mechanical problems such 













228 


H. R. Ricardo 


as these, there is never any end for, so soon as they have been mastered, a new set 
of conditions arises, a higher speed, or a lower weight is demanded and a new crop 
at once appears. 

We had never been under any illusion that a petrol engine could satisfactorily be 
converted into a Diesel by merely changing over a few parts, except for purely 
experimental purpose^; in fact, when fully developed, they share scarcely one 
single part in common. 

Despite these and many other mechanical difficulties, the high speed Diesel 
engine of to-day is at least as reliable, in some respects much more so, than the 
petrol engine and is now in almost universal use for all forms of heavy road transport 
both here and on the Continent. 

I have tried, as briefly as possible, to describe a piece of research and development 
work which has extended over nearly 30 years, but have only been able to deal with 
the more prominent features of it. The combustion problems, which looked so 
formidable at the start, presented really surprisingly little difficulty and, once 
overcome, they did not recur. Far otherwise was it with the mechanical problems 
which cropped up anew at every turn, while the solution of one problem frequently 
gave birth to another. 

In conclusion, I would like to emphasize once again how much we owed to the 
valuable advice and generous help which was always forthcoming from all quarters, 
in our hours of need. In particular, we owe a deep debt of gratitude to the metal¬ 
lurgical industry in general, who not only gave us invaluable advice, but went to 
endless trouble to produce for us the materials we needed. 








Three-dimensional stress systems in isotropic plates. II 

By A. E. Green and T. J. Willmore 

(<Communicated by 0. R. Goldsbrough, F.R.S. — Received 9 September 1947) 

The problem of an isolated force uniformly distributed through a plate of finite thioknesa 
and acting parallel to the faces of the plate, which are free from applied stress, is solved in 
integral form. Numerical work is carried out for the average values of the stresses in order to 
compare the results with those found from the approximate theory of generalized plane 
stress. In addition, numerical values of the stresses are found on the faces of the plate and on 
the plane midway between the faoee. 

*1, Introduction 

It was pointed out in a previous paper (Green 1948 ), referred to subsequently as 
paper I, that the acouracy of the usually accepted theory of generalized plane stress 
in a plane plate can only be tested satisfactorily by comparing the results obtained 
from the theory with those found from three-dimensional solutions of the elastic 
equations when no simplifying physical assumptions are made, such solutions having 
also, of course, considerable interest in themselves. The problem solved in paper I 
was one of stress concentration at a circular cylindrical hole in a plate under uniform 
tension. In the present paper the problem discussed is that of an isolated force 
uniformly distributed through a plate of finite thickness and acting parallel to the 
faces of the plate which are free from applied stress. This problem has only limited 
physical interest, but the mathematical analysis is much simpler than for the stress 
concentration problem and a fairly comprehensive picture of the stress distribution 
can be obtained With relatively little numerical work. 


2. Fundamental equations 

In rectangular co-ordinates the equations of equilibrium of a homogeneous 
isotropic elastic solid, when body forces are absent, are 


dxx dxy dxz 
dx + dy + dz 


0 , 


agy . 3yy , 8yz _ n 
dx dy + dz ’ 


dxi dyz dzz 
dx dy + dz 


0 . 


( 2 * 1 ) 


The components of stress xx, ...,yz are given in terms of the displacements 
(u x ,u y ,u„) by the equations 

35b — AA + 2 pe^, yz - pe„; 

ffi « AA + 2 /te w , zx = /*e M , ■ ( 2 - 2 ) 

zz « AA *+■ xy * p&xy** 

[ m ] 15-a 



230 


A. E. Green and T. J. Willmore 


where 


a dx + dy + dz ' 


and the strain components e^, e ye are given by equations of the type 




du« . du a 


~ dx 9 ~ Sz + 0 y ’ 


If the expressions ( 2 * 2 ) are substituted in ( 2 * 1 ) it is found that the displacements 
satisfy the equations 

+ s) a - °'l (2*6, 


V 2 A « O.j 

In discussing stress distributions in plane plates Doug all ( 1904 ) used certain types 
of solutions of (2*5) of which some use will be made here, A possible set of displace* 
ments which satisfy ( 2 * 6 ) is 

do) d<b ' 

Ux== di +a dx +2z dtedi’ 

do) B6 . d*6 
Uv ~ ty +Ct ty +2z dyd~z’ 

doj d<j> 0Y i (2 ’ 6) 

u ‘ = dz~ u di +2z w’ 

4-2(1-.)®, 

where w, 95 are harmonic functions, i.e. they satisfy the equation 

- 0. - (2-7) 

The oonstant a in equation ( 2 * 6 ) is defined by the relations 

a * (A + 3/i)/(A+/t) = 3-41/, ( 2 * 8 ) 

where 17 is Poisson’s ratio. 

In the present paper it is more convenient to use cylindrical polar oo-ordinates 
(r, 6, 2 ), and the displacements (jt r , u e , u t ) can be found from ( 2 * 6 ). Thus 

3w 3cS 3*rf ■) 

“'-¥ +a a; + 22 sl- 

(2*9) 

1 ca> adfi 2 2 3 2 ^> 

** r 30 + r 80 r SOdz\ 

In terms of (u r , u e , u t ) the strains are 

du. 1 du e u. 3u, \ 

e„ = -=~ r , e flfl = , e„=.-r-f, 


3w r 

13u„ w r 

- 57 . = 

'‘jjwjj" *4* * J &gg 

r ad r ♦ 

1 du e Bug 

du r Bu t 

m 1 v 

r 3(9 02 ’ 

e «“3^ + 37 


3% , 13u r 

’ 3r~r r 30’J 


( 2 * 10 ) 



231 


Three-dimensional stress systems in isotropic plates. II 


and the stresses are related to the strains by the equations 

rr = AA + 2 / «e n . > dz » 

SB = AA + 2/iCgg, zr = /te,,, 

22 = AA + 2/^e^, = /te^. 

From ( 2 * 6 ), (2-9), ( 2 - 10 ) and ( 2 * II) it is found that 


rr 3*0 3*6 _ i 

2/i 0 r a+a 0 r i+ 2 0 z 0 r* 


+ (a- 3 )^. 


§& Ida) ld*(o ad<j> a3 2 ^ 2z d 2 <f> 2z d z (j> 3*0 

2^ “ r dr + ^ W* + r dr + 7*30* + 7 a*3r + r* 3730* + (a ~ 3 * ~3z* ’ 


zz d*a) d 2 (j> 0 d z <f> 

2/i dz 2 dz 2 + ~ Z dz 9 9 

Oz 1 dho 1 d 2 <f> 2z d z <f> 

2/i r dddz + r dddz + r dddz 2 ' 

zr ^ d % (o d 2 <f> d z <j> 

2/i drdz + drdz + Z drdz 2 ’ 

rd _ 1 d 2 o) l d<o 2z d*<f> 2 z d z <j> a d 2 <j> ad(f> 

2/i r dr dO r 2 dd r 2 dzdO + r dz dr dd + r dr d6 r 2 dd * 


( 2 * 11 ) 


( 2 * 12 ) 


3. The isolated force problem 


Consider now a plate of infinite extent and of uniform thickness 2h , bounded by 
the planes z » ± h. An isolated force of intensity P per unit length is directed along 
the axis of x and is situated along the z axis from z = -A to z = H-A. 

If h is very large the problem is the ordinary one of plane strain and the solution 
is well known (Love 1927 , p. 209). The displacements in this case are* 


{^r^j ,0@ r ) 8i " 

u t = 0 , 


cos 0, 


and the stresses are 

~ ^ (3-29)oosfl 

47r (l-j?)r 

^ = P (l-2i/)cosfl 

4n (1 — r/)r 


^ _ Pi) cos 0 
22 ~ 2 tt( 1 — rf)r’ 

3 . JL (l-2y)sind 
4 n (1 —i))r 


(3-1) 


(3*2) 


with fz and Sz both zero. The corresponding ‘generalized plane stress’ solution for 
the average values of the stresses is obtained from the plane strain solution by 
replacing ^ by tr = 77/(14-77) with the exception that the stress zz is omitted. 

* These differ by a rigid body displacement from those given by Love. 




232 A. E. Green and T. J. Willmore 


4. It is convenient to introduce non-dimensional oo-ordinates ( p , Q defined by 

p = rjh, £ = z/h. (4*1) 

* 

When the plate is of finite thickness 2 h the boundary conditions to be satisfied on 
the faces £ = ± I of the plate are* 

zz = fz as §z « 0, (4*2) 


and the plane strain solution does not give a zero value of the normal stress zz on 
the faces. It is therefore necessary to find a suitable stress system which has no 
singularities in the plate, and which cancels the stresses zz on the faces £ = ± 1 
without introducing Bhear stresses there. The additional stress system must also 
vanish at infinity. 

It is therefore necessary to find functions which are even in the co-ordinate £, 
and are such that when £ = ± 1 


da) d6 _ 3 z 6 
te + di +2z d? ~ 0> 


' 


'dz^ dz» 


Prj cos 6 
4/m(l — 7j)r' 


(4*3) 


The first of these conditions ensures that fz = Oz « 0 at £ * ± 1 , and the second con¬ 
dition cancels the stress zz on these faces. Suitable forms for to, <p are 


a) * cos# 
( f> = cos d 


/; 

r. 


f{u)J x (up) cosh ugdu, 


g(u) J x (up) coahugdu. 


The conditions (4*3) are now satisfied at £ =* ± 1 if 

/(w) sinh w+ p(tt) (sinh u-f 2 w cosh w) = 0 , 


{/(«) cosh 


u—g(u) (cosh u — 2u sinh w)} v*J x {up) du 


Pyh 

4n/ip(l-j/Y) 


The second equation of (4*5) is satisfied if 

/(«) cosh u — g(u) (cosh u~2u sinh u) 
so that, solving for f(u) and g(u), 


Prjh, 

&npu % ( l—rjY 


/(«) 

?(») 


P?/A(sinh u + 2u cosh u) ' 

PyAsinhu 
— j/)S’ 


(4-4) 


(4-5) 


(4-0) 


(4-7) 


where 


£ * 2u + sinh2u. 

* Except possibly at p ■ 0, £ = ± 1 


(4-8) 



Three-dimensional stress systems in isotropic plates . II 

Thus the required values for oj and <j> are 


233 


(0 = 


Pr/h eoe 6 C 00 (sinh u + 2 u cosh u) cosh wf 

47r/^(l — 97 ) J o 


du, 




Pr/hcosd f^sinhwcosh 


4n/i(l — 37 ) Jo 


1 


it a E 


du. 


(4*9) 


These integrals are not convergent at the lower limit, but the divergent terms 
(which represent a rigid body displacement) are trivial and may be removed by 
adding suitable terms which do not affect the boundary conditions (4*2)* Thus 
satisfactory solutions for oj and 0, which converge at the lower limit, are 


0) 


Pr/hcosd |*°°((sinh u + 2u cosh u) cosh u^J^up) 3 pe~ 


4n/i(l — r/) 


/;( 


M a S 




<A = - 


Pr/hcosd C °°<8inhw cosh u^J^up) pe~ u 


4nji(l —r/) 


n 




8 u 


du. 


(4*10) 


5. By substituting the potential functions found above in equations (2-12), the 
stresses from the combined system which satisfies the boundary conditions (4*2) 
are found to be 


— 2?? 2r/p r 00 

—+ ,, — -r [{(2?> — 1) sinh u oosh ut+u cosh u oosh uC 

t-v) (*-V)J 0 1 


2nr. rr _ 3 — 2r, 

PoobO 2(1 - 

— sinh u sinh it£} J[(up) + 2 rj sinh u cosh u^J^up)) S _1 du, 

2nr. @ _ 1 — 2tj 2ijp 

PcO8 0 _ 2(1 — 7,) (1 


(5-1) 


\r/p f® 

— j [{(2i, — 1) sinh u cosh it£+ it cosh it cosh u£ 


2nr.zz 


2nr , Bz 
P sin 0 

2trr.zf 
P oos(9 

27rr .iff 
Pain 6 


— tt£ sinh u sinh uQ {J[(up) + J^up)} — 2rj sinh it oosh ut^J^up)] S _1 dit, 

(5-2) 


; + 


i u 008 ^ "*■ u cos ^ u c(>8 ^ U C 


(i- V ) (i 
■ it£sinh it Binh uQ J^up) du, 

2v f ® 

— I {cosh it sinh tt£— £sinh u cosh u£) J x (up) Z" 1 du, 
(it cosh it sinh it£ - w£ sinh u cosh «£} J[(up) S -1 du, 


(1-7,) 

2i)p 


(l-i,) 

1-27, 


r. 


2 VP 


-j {(2„-l)sinhit ooshtt£+w cosh it cosh it£ 

) J 0 


2(1-77) (1-7). 

—it£ sinh u sinh «£} { J{{up) + J\{up)} S _1 du. 


(5-3) 

(5-4) 

(5-5) 

( 6 - 6 ) 


It will be seen that the parts of the potential functions which were added to secure 
convergence make no contribution to the stresses. 



234 


A. E. Green and T. J. Willmore 


6. AVERAGE VALUES 07 THE STRESSES 
By integrating the stresses obtained in the previous section the following average 


values are found: 


2nr.fr 
P 008 0 




2 nr ,zz 


Poos 6 ( 1 — 9 ;) 


V 


{4pQ(p)~l}, 


2nr .r§ 1 - 2 w pn % , . ., 

Psin0 ~ 2(T-7) + (rr^)^ (/ ° ) + jff( '°^’ 


where S W H(p) 

The values of zf and $z both vanish. With the help of the identity 


sinh a w 


l_2u+l 
2 


2 t* 


£ “2 2 £ ’ 
the integrals (6-2) can be written in the form 

. aw = > f ‘ f 2!LtirrJWe),„, 

VFy 2Jo u 2 Jo m£ 

ir*ae»*i -Jf 

2 jo u 2 Ji 


#(P) 


(2 m +1 — e~ 2u )J 3 (up) 
m£ 


( 6 - 1 ) 


( 6 - 2 ) 


(6-3) 


(6-4) 


The integrals not involving £ can be evaluated immediately, using the results 
(Watson 1944 , p. 391) 


J. 


a Mup) 


du ** 1 


■ J. 


°*4(«p) 




1 

3’ 


(6-5) 


provided p > 0. The other integrals may be evaluated by expanding the Bessel 
functions and integrating term by term, making use of the known integrals 



J. 



~£~ 


du. 


( 0 . 0 ) 


which have been tabulated by Howland* ( 1930 ). When s > 20 it is sufficient to take 
the asymptotic value of /„ which is s!/ 2 », while the J, integrals can be neglected 
compared with I,. In addition it is necessary to know the value of I 0 —J 0 . This has 
been evaluated by numerical integration and is 0.39900. 

The resulting series are only convergent for p < 2 . This range of values of p, how¬ 
ever, iB found to be sufficient. When p is small the series are rapidly convergent, but 


* The values of / u and I u were corrected by Howland in a later paper (Howland A Stevenson 
1933). 





Three-dimensional stress systems in isotropic plates . II 235 

when p is greater than 1 the convergence is slow and the series may best be summed 
by using ah Euler transformation, since the terms of the series alternate in sign. 
The values found in this way are recorded in table 1 . The series for the integrals 
G(p), H(p) may be partially summed, leaving a series which is convergent for values 
of p up to 4. The numerical values in table 1 were checked by this method. 


Table 1 


p 

0 

0*2 

0*4 

0*0 

0*8 

1*0 

G 

0*5000 

0*4420 

0*3863 

0*3348 

0*2887 

0-2467 

H 

0-1667 

0-1665 

0*1057 

0*1635 

0*1597 

0*1544 

P 

1*2 

1*4 

1*6 

1*8 

2*0 


G 

0*2140 

0*1867 

0*1035 

0*1446 

0*1291 


H 

0*1478 

0*1402 

0*1322 

0*1240 

0*1159 



Table 2. 

Values of average stresses for 

ij = 0-25 


P 

0 

0*2 

0*4 

0*0 

0*8 

1*0 

27rr.fr 

Poos# 

-1*6607 

- 1*0565 

- 1-6483 

-1*6418 

-1*0308 

-1*0331 

2 nr. 06 

Pcos# 

0*3333 

0*3527 

0*3664 

0*3754 

0*3804 

0*3826 

2nr.zz 

P cos 0 

— 0*3333 

-0*2155 

-0*1273 

-0-0655 

- 0*0254 

-0*0017 

2nr. rO 

P sin 6 

0*3333 

0*3435 

0*3517 

0-3582 

0*3032 

0*3009 

P 

1*2 

1*4 

1*0 

1*8 

2*0 

00 

2 TTT.ff 

P cos 0 

-1*0304 

-1*6285 

- 1*6272 

-1-6264 

-1*0258 

-1*0250 

27rr . 00 

PcobO 

0*3830 

0*3823 

0*3811 

0*3798 

0*3780 

0*3750 

2nr.zz 

PcozS 

0*0105 

0*0152 

0*0155 

0*0137 

0*0109 

0 

27tr.r0 

n _* a 

0*3690 

0*3715 

0*3728 

0*3730 

0*3742 

0*3750 


PsinO 


, The values of the stresses were calculated from equation ( 6 * 1 ) for two values of 
Poisson’s ratio yj = 0*25 and rj — 0*5 (corresponding to an incompressible material), 
and the results are recorded in tables 2 and 3. The values for p — 0 (i.e. h-+co) are 
those for plane strain, and for p-+co (i.e. the values are those given by the 

theory of generalized plane stress. The results are shown graphically in figures* 1 
and 2 . Although the stresses 1 are not found for values of p between 2 andco it appears 
probable that little variation will occur in this range of values of p. 

* For convenience, in figures 1 to 6, the radial stress rr is halved in magnitude before 
plotting. 




236 


A. E. Green and T. J. Willmore 


Table 3. VALtras of average stresses for r ) =* 0-5 


p 

0 

0*2 

0*4 

0*6 

0*8 

1*0 

2rrr.fr 

Poos 6? 

— 2*0 

-1*9392 

-1*8896 

-1*8506 

-1*8206 

-1*7984 

2rrr. 00 

Pcos 0 

0 

0*1160 

0*1986 

0*2523 

0*2826 

0*2959 

2?rr.J5 

Pcos0 

-1*0000 

-0*6464 

-0*3819 

-0*1965 

-0*0762 

-0*0052 

2rrr.r0 

Pain 6 

0 

0*06085 

0*1104 

0*1495 

0*1794 

0*2016 

P 

1*2 

1*4 

1*6 

1*8 

2*0 

GO 

2nr.r? 

Pcos0 

- 1*7824 

-1*7712 

-1*7634 

-1*7583 

-1*7550 

-1*7600 

2rrr. 00 

Pcos 0 

0*2981 

0*2939 

0*2866 

0*2788 

0*2714 

0*2600 

2nr.zz 

Poos/9 

0*0315 

0*0465 

0*0464 

0*0411 

0*0328 

0 

2?rr. r0 

0*2176 

0*2288 

0*2366 

0*2417 

0*2460 

0*2600 


Pain 6 



P 

Figtjkb 1. Average stresses when r\ = 0-25. 


Curve a 


2nr ff 
PoobO 2 ‘ 


Curve c 


27rr.rfl 

Paintf’ 


2nr.6$ 

Poos0* 


2?rr.as 
Poos#’ 


Curve 6 


Curve d 




237 


Three-dimensional stress systems in isotropic plates, II 

When Poisson’s ratio is 025 it is seen that the generalized plane-stress value for 
rf/cos# (i.e. corresponding to p-+ oo) is a very good approximation to the actual 
average value of this component of stress over a wide range of values of p, the 
greatest difference being only about 2\ % when the plate is very wide (p-»*0). There 
is a greater percentage variation in the values of $$/oos# and rt)/ sin#, particularly 
for small values of p. The average stress zz , which is neglected in one form of the 
generalized plane-stress theory, is very small over a wide range of values of p, but 
when p is smaller than about 0-4, zz becomes comparable with $0 and cannot be 
ignored. The small values of p, however, correspond either to wide plates, for which 
the plane-strain solution is more applicable, or for points fairly near the applied 
force where the mathematical solution is of doubtful physical value. 



FiatTRK 2. Average stresses when if = 0*5. 


Curve a 

2nr rr 

Curve c 

2irr.r0 

P cos 6 2 ' 

PainO 

Curve b 

2nr.Sd 

P cos 6 * 

Curve d 

2nr.zz 

P cos d 


When Poisson’s ratio has the limiting value 0-5 for an incompressible material 
there is much greater variation from the generalized plane stress average values of 
the stresses, particularly when p is less than 1, the greatest variations occurring in 
5S/COS0 and r#/ and. The average stress zz is still small for a wide range of values 
of p, but when p is less than about 0-8 its value is comparable in magnitude with the 
other stresses. 

Broadly speaking it may be concluded that the generalized plane-stress theory 
gives quite a good approximation for the values of the average stress rr for moderate 
values of Poisson’s ratio 17, the approximation decreasing a little in accuracy as y 




238 


A. E. Green and T. J. Willmore 


approaches the limiting value 0*5. The generalized plane-stress theory is a little less 
reliable in its estimate of the remaining average stresses, particularly for the larger 
values of y. 

Attention in the next sections is directed to the actual values of the stresses at 
particular points of the plate in order to get some idea of the variation from the 
average values. 


7. Values of the stresses on the surface of the plate 


The values of the stresses on the surface of the plate may be obtained by writing 
£ * ± 1 in the formulae of § 5. After some reduction it is found that* 


2nr ,fr 

Pcos 8 

2nr 

Pcos# 

2nr.r$ 

P sin 6 


2j w~ ir + j~ { - {3+v) L{p)+{l ~ v) M{p)] ’ 

l^L +v{L{p)+M (/>)}, 


(7-1) 


where L(p) - f M(p) 

Jo i* Jo 2* 


Table 4 


p 

0 

0*2 

0*4 

0*0 

0*8 

1*0 

L 

0 

0*007548 

0*02870 

0*06955 

0*09506 

0*1305 

M 

0 

0*0000433 

0*000653 

0*003002 

0*008348 

0-01745 

P 

1*2 

1*4 

1*6 

1*8 

2*0 

oo 

L 

0*1626 

0*1802 

0*2098 

0*2240 

0*2351 

0*2500 

M 

0*03032 

0*04629 

0*06429 

0*08320 

0*1020 

0*2500 


Table 5. Values of the stresses on the surface of the plate for ij =* 0-25 


P 

0 

0*2 

0*4 

0*6 

0*8 

1*0 

2nr.fr 

P COB 0 

2nr.$8 

Pcos# 

2irr.rd 

Psin# 

-1*4167 

-1*4248 

-1*4476 

-1*4804 

-1*5176 

-1*5537 

0*5833 

0*5789 

0*5664 

0*5478 

0*5258 

0*5028 

0*2500 

0*2519 

0*2573 

0*2656 

0*2759 

0*2870 

P 

1*2 

1*4 

1*6 

1*8 

2*0 

00 

27rr.fr 

Pcos# 

2nrM 

Pcos# 

-1*5852 

-1*6100 

-1*6279 

-1*6395 

-2*6459 

-1*6250 

0*4809 

0*4614 

0*4449 

0*4313 

0*4207 

0*3750 

%nr*rb 

Pain# 

0*2082 

0*3089 

0*3185 

0*3270 

0*3343 

0*3750 


* Except when p = 0. 



239 


Three-dimensional stress systems in isotropic plates. II 

The stresses zi, §z and fz vanish on the surface of the plate. For values of p from 0 
to 2 the integrals L(p), M(p) may be evaluated by a similar method to that used for 
the integrals 0(p), H(p) in § 6. The numerical values are given in table 4. Using these 
values the surface stresses (7-1 ) were calculated for values of ij equal to 0-25 and 0*0, 

Table 6. Values of the stresses on the surface of the flats for 7— 0*5 


p 

0 

0*2 

0*4 

0*6 

0*8 

1*0 

2nr.fr 

Pooa# 

- 1*0000 

-1*0264 

-M001 

-1-2069 

-1*3285 

-1*4480 

2 nrM 

P 008 6 

2 nr.rft 
Pain 6 

1*0000 

0*9811 

0-9279 

0*8496 

0*7582 

0*6650 

0 

0*0038 

0*0147 

0-0313 

0*0517 

0*0740 

P 

1*2 

1*4 

1*6 

1*8 

2*0 

00 

2nr.fr 

P cos# 

2 nr.Od 
PcobO 

-1-5539 

-1-6391 

- 1*7022 

-1-7456 

-1-7719 

-1*7500 

0-5783 

0-5039 

0-4434 

0-3962 

0*3613 

0*2500* 

2 nr.rd 

0*0965 

0-1177 

0*1370 

0-1541 

0*1686 

0*2500 


P sin 0 



Figube 3 . 
Curve a ■ 


Values of stresses at the faces of the plate when 7 = 0*25. 


2 nr fr 


Poosd 2 * 


Curve 6 


2nr.v0 

Pcoe#’ 


Curve c 


2nr.r& 
Pain 6* 


and the results are recorded in tables 5 and 6. These results are shown graphically 
in figures 3 and 4. The stresses are also compared with their average values in 
figures 7 to 12. It should be notioed that the values given in tables 5 and 6 for p -+ 0 
are different from the plane strain values which correspond to p « 0 owing to the 
removal of the surface stress zz in the present solution. 




240 A, E* Green and T. J. Willmore 

Near p ** 2 , and possibly for p> 2 , the stress fr at the surface of the plate is very 
slightly in excess of the average value. For smaller values of p, rr is very much 
smaller than the average values especially for the larger values of y. The shear 
stress r& at the surface always appears to be smaller than the average value, the 
greatest differences again occurring for the larger values of t/. The values of SB at 
the surface are considerably in excess of the average values for the range of p con¬ 
sidered here, especially for large values of rj. 



Figure 4. Values of stresses at the faces of the plate when tj — 0*5. 

2 7tt fr _ _ 2nr.Sd ^ 2 nr.rB 


Curve a — 


Pcos0 2 


Curve b 


Pcos0* 


Curve c 


-Fein#* 


8. Values of the stresses at points inside the plate 

For internal points the stresses (5*1) to (5*0) may conveniently be expressed in 
the forms 

2nr . fr _ 3 - 2y r\p 

P cos# ~ 2(1 -7) + 4(1-77) 

+(»v-i){Jia+»+A(i-o>+a-OAa+a+(i+4:)j r .(i-o], (8-i> 

2tTT .SB = 1 — 2*7 TIP 

P cos £ 53 2(1 — if ) + 4(1 - 77 ) 

X EC6V+1)K(1 +0+4(1 -0>~(1 -€) +0-Cl+0-4(1 -0 

-(2^-l){/ 4 (l + 0 + 4(l-C)}-(l“QA(l+0-(l + 0/5(l"0L (8-2) 

( 8 - 8 ) 




Three-dimensional stress systems in isotropic plates. II 241 

y^-^W-OV+O-U+tW-O). (8-4) 

Ojw Art 

y^-i^^-Q^U+OMi+OAU-O-O-Mi+O+u+Mi-O}, 

- ( 8 - 6 ) 

27rr. rO _ 1 — 2 i/ t)p 

Psin0 2(1 — 1 ;) + 4(1 — rf) 

X [(29? -1) {/*(i + 0+i 2 (i - 0} ■+ (l - £)J,(i + 0+(iHW-0 
+( 2 3 ? -i){J 4 (i+ 0 +J 4 (i-C)}+(i-Q/ J (i+Q+(i+C)i 6 (i-D]. ( 8 - 0 ) 

where* «r) . J.M - 

Jo ^ Jo 2- 

m , <w (8 .„ 

4w - f» g*-ysa e > ^ 

J 0 ^ 

It is not possible to evaluate these integrals by the methods used in the previous 

C 00 

sections owing to the absence of tabulated values of —=— du for values of t other 

Jo i* 

than 0 and 2 . 

Sneddon ( 1946 ) has used an approximate expression for E in evaluating similar 
types of integrals to these found here (the first three integrals in (8-7) appear in 
Sneddon’s paper), and this approximation is probably sufficiently accurate for the 
present work. Sneddon replaces/(u) = uj(u + sinh u ) by 

fi{u) * (fiu + l)e- ,u + 2ue- u , 

where ft and v are chosen so that the area under the curve —/is as small as possible. 
This condition is said by Sneddon to be satisfied to a large degree when ft = —1-55, 
v =° 1'40. 

Using this approximation the integrals (8-7) can be written 

h(x) = F t (2 - x) - F a {2 + a;) + \ft{F y (2v - x) - F x (2v + x)} + ${P 2 ( 2v-x)-F t (2v+x)}, 

I s (x) m P s (2 - x) - F s (2 + x) + lft{F 3 (2v -x)-F 3 (2v+ a;)} + \{F t (2v -x)- F t (2v+ x)}, 

h(x) » F t (2-x)+F s (2+x) + $/?{P 8 ( 2 i>-x) + Pj(2y+x)}+l{P,(2v-x)4-P 8 (2i>+x)}, » 

/«(*) * F t {2-x)~ F t {2 +x) + \ft{F t (2v - x) - F s (2v + x)} + i{P 7 ( 2 v - x) - F,{2v +x)}, 

/ 6 (x) = P 8 (2 - x) + P 8 ( 2 + x) + \ft{F % (2v - x) + F 3 (2v +x)} 4- KP € (2 j/-x) + F t (2 v +x)}, J 

( 8 * 8 ) 

* These integrals must not be confused with the I 9 integrals in ( 6 * 6 ) or with Bessel’s 
fractions of imaginary argument. 



A. E. Green and T. J. Willmore 


242 

where 


Ji(*> 

w 

w 

F t (x) 

*»(*) 

w 

F,(x) 

i\(x) 


~j*pw~*»J 0 {up)du « 

=S Jo° e_XUJl 


Ji(up) du 


(x* +/>*)* {(a?*+/>*)* +ar}’ 


=J u~ l e~ xu J 1 (up)du = ^ 


= 1 , 


(ar a -4-p a )* + a:* 
9 


0 6 XUj * (u P )du - ( X l + p*)l{(x*+p*)*+xf’ 


y-lg-XU J Jup ) du = - 7; -- ——T-5? » 

0 3W/ 3{(x a +p a )‘ + *} s 

f 05 r / V T P 8 {3(2! 2 4- P 2 )* 4* 

■J. 


•<»*») 


for suitable values of p and ar. Using these formulae the stresses may be obtained 
at any point inside the plate. Values of the stresses on the middle plane £ = 0 are 
likely to be of particular interest and calculations here have been confined to these 
stresses. The stresses for £ « 0 only depend on the integrals / 2 (1), ...,/ 6 (l), and the 
values of these integrals found from (8*8) and (8*9) are shown in table 7. Using 
these values the stresses were then found from (8*1) to (8*6) for Poisson’s ratio 
7j - 0*25 and iff « 0*6 and for values of p between 0 and 2. These values are recorded 
in tables 8 and 9 and are shown graphically in figures 5 and 6. The stresses are also 
compared with the average stresses and with the values at the faces of the plate in 
figures 7 to 12. Although the stresses in tables 8 and 9 are given to four decimal 
places it must be remembered that these values are obtained by using an 
approximation, and they may only be acourate to three or possibly two decimal 
places. 

In general, the differences between the stresses on the middle plane and the average 
stresses are somewhat smaller than the differences between the face stresses and the 
average stresses. The stresses on the faces and on the middle plane are mainly 
situated numerically on opposite sides of the average values, except for some 
of the values of id at the smaller values of p and some values of ft at the larger 
values of p. The values of zz on the middle plane are mostly larger than the 
average values zz> 

Actual values of the stresses have only been found at the middle plane and at the 
faoes of the plate, but these values are sufficient to show that the stress distribution 
through the plate differs significantly in some cases from the average values especi¬ 
ally for the larger values of 7). The generalized plane-stress theory, however, appears 




Vol. 193. A* 












244 


A. E. Green and T. J. Willmore 



Figure 7. Values of — —■ for v = 0-25. 

Pooad ' 

Curve a Average values. Curve b Face values. Curve c Values on middle plane. 



Figure 8 . Values of — for tj = 0*5. 

Poosd ' 

Curve a Average values. Curve b Face values. Curve c Values on middle plane. 



Figure 9. Values of 


2mr.6d 
Roots 6 


for 7/ 0*25. 


Curve a Average values. Curve b Faoe values, Curve c Values on middle plane. 





Three-dimensional stress systems in isotropic plates. II 


245 



Curve a Average values. Curve b Face values. Curve c Values on middle plane. * 



Figuhe 11. Values of for r/ = 0-25. 

Pain 6 1 

Curve a Average values. Curve 6 Face values. Curve c Values on middle plane. 


to give a reasonable approximation for the average stresses for cases in which the 
theory is usually assumed to be valid, the accuracy decreasing with increase in 
Poisson’s ratio 

Table 7 


p 

O'2 

0*4 

0*8 

1*2 

1*6 

2*0 

/.(l) 

0*071314 

0*12855 

0*18250 

0*18033 

0*15811 

0*13352 

mu 

0*17171 

0*28875 

0*33275 

0*26619 

0*19830 

0*15030 

MU 

0*00087707 

0*0060575 

0*029796 

0*055341 

0*071461 

0*078206 

MD 

0*0035804 

0*023181 

0*093374 

0*13938 

0*14818 

0*13832 




246 


A. E. Green and T. J. Willmore 



27rr.r0 

Figure 12. Values of — for 17 = 0*5. 

P&mO 

Curve a Average values. Curve b Face values. Curve c Values on middle plane. 


Table 8. Values of the stresses in the middle 

PLANE OF THE PLATE FOR 7 ) = O'25 


p 

0 

0*2 

0-4 

0*8 

1*2 

1*0 

2*0 

00 

2 nr.tr 

P cos 6 
2 nr.ee 
P cos 0 

—1*6667 

-1-0764 

-1*6931 

-1-7041 

-1*6778 - 

■1*0477 

-1*0281 

-1*6250 

0*3333 

0*3334 

0-3342 

0-3393 

0*3479 

0*3559 

0*3614 

0*3750 

2nr.ez 

P cos 0 
2nr.rd 
PvinO 

— 0*3333 

-0-3000 

- 0*2221 

-0-0686 

-0*0239 - 

•0*0408 

-0*0451 

0 

0*3333 

0-3380 

0*3490 

0-3760 

0*3909 

0-3261 

0*3943 

0*3750 



Table 9. 

Values of the stresses in the middle 






plank of the PLATE FOR Tj = 0*5 




P 

0 

0*2 

0*4 

0*8 

1*2 

1*0 

2*0 

oo 

2nr.fr 

P cos 6 
2nrM 

P cos 0 

- 2-0000 

— 2-0220 

-2*0058 

-2-0700 

-1*9027 

1*8514 

-1*7785 

-1*750 

0 

0-0110 

0*0405 

0*1216 

0*1895 

0-2288 

0-2455 

0*2500 

2 nr.zz 

P 008 0 
2nr.t$ 
P sin 0 

- 1*0000 

-0*9028 

-0-0662 

-0*1750 

0*0710 

0*1405 

0*1353 

0 

0 

0*0175 

0-0024 

0*1704 

*0*2433 

0*2772 

0*2880 

0*2500 





Three-dimensional stress systems in isotropic plates . II 247 

The paper is concluded by adding the solution of the isolated force problem for 
a plate of semi-infinite thickness. 


9. Solution for a plate of semi-infinite thickness 


Consider now an isolated force of intensity P per unit length, directed along the 
axis of#, and situated along the z-axis from z = 0 to z * oo, acting in a plate which 
is bounded by the single plane z = 0. The boundary conditions (4*3) are now to be 
satisfied when z = 0 and the additional stresses must vanish at z =» oo as well as 
r — oo. Suitable forms for the w and <f) functions are therefore 

POO /* 00 

w = cos# f(u)J x (ur)e- us du, <j> — cos# g(u) J x (ur)e^ m du. (9*1) 

Jo Jo 

It follows from (4*3) that _j_ g^ q, 

f°° Py ■ (9*2) 

J 0 (/W->W \u'Mnr)iu - teA| ,., )r J 

Solving for /(w) and gr(w) and adding suitable terms to secure convergence of the 
integrals at the lower limit it is found that 

. Py cos 6 f 00 ( e-^JJur) re~ u 

H?—*r 



From (2*12), (3*2) and (9-3) the complete stress system which satisfies the 
boundary condition (4*2) at the plane z = 0 is found to be 


2tjt . fr 
P cos 8 


+ f JP: ..- {(2, + 3) F 3 (z) - 3 zF h (z) -(1-2,) **,(*) + 


2wr, @ 1- 2, 

Pcosd ~ 2(1-7)) 


+ {(6, + 1) F,(z) - zF 6 (z) + (1 - 2,) F e (z) - zF g (z)}, 


2nr.zz __ rj , 
P cos 0 1 — if) * 


-rF s (z)-rzF t (z)}, 


2nr. Oz _ r)zF a (z) 


2nr .zr 
Pcosd 
2nr. rd 
P sin (9 


Vz{F a (z)_-F,(z)} 

*1-V. 

{(** - 1 ) W + + (%-') W + afy*)}, 


(9-4) 

(9-5) 

(9-6) 

(9-7) 

(9-8) 

(9-9) 


where /^(s)... are the functions defined by the equation (8*9) except that p in (8*9) 
must be replaced by r. The values of the stresses at the faoe z - 0 reduce to the 
simple forms 

2nr.fi 1 _ 2,*4-3, — 3 2 nr.6b 2,* —, + l 2nr.rfi_l- 2, 

Poosd ~ ' 2(1-,) ’ PcosO “ -2(T-,r ’ /’sin 6 ~ 2 ’ ( ' ' 
with the remaining stresses zero. These values agree with those found in §7 at the 
surfaoes of the plate when p-+ 0. 





248 


A. E. Green and T. J. Willmore 


References 

Dougali, J. 1904 Trans . Roy . Soc. Edinb. 41, 329. 

Green, A. E. 1948 Phil. Trans. A, 240, 561. 

Howland, R. C. J. 1930 Phil Trans . A, 229, 49. 

Howland, R. C. J. & Stevenson, A. C. 1933 Phil Trans . A, 232, 155. 

Love, A. E. H. 1927 A treatise on the mathematical theory of elasticity, 4th ©d. Camb. Univ, 
Press. 

Sneddon, 1. N. 1946 Proc . Camb. Phil . Soc . 42, 260. 

Watson, G. N. 1944 A treatise on the theory of Bessel functions, 2nd ed. Camb. Univ. Press. 


On the interpretation of the Foucault test 


By E. H. Linfoot, University of Bristol 
(Communicated by C . it. Burch , F.R.S .— Received, 27 October 1947) 


The diffraction theory of the Fouoault test is applied to discuss the intrinsic errors of the test 
when the Fouoault shadows are interpreted by means of the ‘ grazing light ’ fiction in the usual 
manner. In § 2, formulae are obtained for the effects on the Foucault shadows of an arbitrary 
small change in the figure of the mirror under test. In § 3 the case of a weak, narrow local zone 
on an otherwise nearly true mirror is discussed and expressions are obtained in terms of tho 
zone-profile for the changes in the Foucault shadows caused by the presence of tho zone. The 
relationship between the zone-profile inferred by means of the ‘grazing light * interpretation of 
the shadows and that actually present on the mirror is analyzed in § 4, and it is shown that the 
analysis suggests some modifications in the practical technique of figuring mirrors under the 
Foucault test. Lastly, the general formulae established for weak, narrow zonal errors are 
compared with the results of computation in four selected special oases and it is shown that 
they may be expected to remain useful approximations in cases of interest to the practical 
mirror-maker. 


1. Introduction 

In his account of the knife-edge testing of astronomical mirrors, published in 1859 , 
Fouoault gave a simple explanation of the properties of the test in terms of ray 
theory. His explanation, too well known to need repeating here, leaves oertain rather 
conspicuous appearances unexplained; nevertheless, it has served ever since as the 
basis of the interpretation of the test by optical workers. In the practical use of the 
test, the unexplained appearances, such as tho brilliant line of light seen round the 
rim of a nearly true mirror, or the dark fringes which form on the bright slopes of 
a zonal error several wave-lengths deep, are ordinarily set aside as * due to diffraction ’ 
and ignored as far as possible, while the interpretation of the knife-edge shadows is 
made in a semi-intuitive manner by means of the ‘grazing light’ fiction. 

This devioe oonsists in replacing the mirror surface in imagination by a flat disk 
on which parallel light falls, at nearly grazing incidence, from the side opposite to 
the knife-edge. The Foucault shadows are interpreted as relief-shadows and the 




249 


On the interpretation of the Foucault test 

bas-relief figure thus 'seen* on the disk is taken to represent the error-figure of the 
mirror. Provided that the error-slopes are sufficiently small, the error-figure inferred 
in this way agrees in shape with that which would be obtained by a photometric 
analysis of the Foucault shadows on the basis of ray theory. This does not mean, 
however, that it necessarily agrees in shape with the error-figure actually present 
on the mirror, and the relationship between the inferred and the actual errors remains 
a matter for investigation. 

In the present paper, the diffraction theory of the test is applied to make 
such an investigation in the case of a weak, narrow zone on the surface of an 
' otherwise nearly true mirror, Foucault tested with a light-source of negligible 
dimensions. 

This case is a favourable one for discussion, since it is one in which the conclusions 
are not much affected by the simplifying assumption that the pinhole-size is negli¬ 
gible, and is not unlike cases actually met with in the zonal figuring of large astro¬ 
nomical mirrors. 


2. Effect on the Foucault shadows of a small change in figure 

Formulae will first be derived for the effect of an arbitrary small change in the 
figure of an unfinished mirror on the intensities seen under the Foucault test. The 
waves leaving the mirror can be described, as in a previous paper (Linfoot 1946 ), by 
means of the wave-function 

E x (x> V) ~ e Zn W x > v) (or 8 + y 2 ^ 1 ) 

= 0 (z* + y*> 1), (2-1) 

where 0 (x, y) measures, in reflexion-fringes, the deviation of the mirror surface from 
the reference-sphere Jfy The deviation may amount to many wave-lengths, but the 
figure of the mirror is supposed to be smooth enough to send all the reflected rays 
through the central parts of the aperture of the viewing system. The intensity seen 
under the Foucault test with the knife-edge in the plane through the centre of 
curvature of M 0> but displaced laterally through a distance c from the central setting^ 
is (Linfoot 1946 , p. 83) 

\2*D(x,r,c')\\ ( 2 * 2 ) 

where, for - 1 <; y £ 1, 

2nD l (x ) y;c f ) = nE^x, y) + ij (2*3) 

2 it 

here Y * ^(l-y 4 ), (2’4) 

* is the radius -of curvature of M 0 , and the integral in (2*3) is a Cauohy principal 
value when When | y \ > 1, the value of Z>i(x, y, c') is zero. 



250 


E. H. Linfoot 


A change in the figure of the mirror oan be expressed by adding to $(x, y) a devia¬ 
tion-contribution y), to give the new wave-function 

E(x, y) = E 1 {x,y)e t1,i '^ x ’ v) 

- E^x, y)( 1 + 2 iriy(x, y)), (2-5) 

where rj(x,y) = 1). (2-6) 

ZlTl 

We suppose that x , y) possesses continuous partial differential coefficients every¬ 
where, that 1 and that ^ 2 (#,y) can be neglected. Then rf(x>y) possesses 

continuous partial differential coefficients; and, to the same order of approximation 
as before, y{x f y) = }Jf(x t y)> q(x,y) is real, and y % {x,y) can be neglected. 

We suppose further that the integral 

T {x ,y), T | milz$£yl d , (2 . 7 ) 

J - r I 

is, for x a + t/ 2 < 1, so small that its square can be neglected. This means, broadly 
speaking, that the changes in profile are not too violent. 

The intensity seen on the mirror under the Foucault test, with the knife-edge at 
the same setting as before, is now 

I(x,y\c ') = | 2nD(x,y; c’) | 2 , ( 2 - 8 ) 

where 

2irD(x,y\ c') = ttE(x, y) + i f —- e fc ' (< x) dt 

J-r t—x 

— 2nD x {x,y\ c') + n.2niy(x,y)E 1 (x,y)-2n( ^ e ic ‘^~ x) dt, 

J-r t x 

by (2*5). Therefore 
| 2nD(x i y ; c/) | a - | 2nD 1 (x > y; c') | a 

+ 2nD x (x t y; c') 2nirj(x J y)-~2n( —— ^ E x (t , y ) e m ~ x) dt 

J —Y t—X 

+ 29i|27rX> 1 («, y ; c') 2mr/{x, y) E t (x, y) - 2ttJ — . 

Sinoe I T £ l(t , y) | g T(x, y), 

IJ-r *-* I 

the second term on the right can be neglected provided that x is not near the edge of 
the mirror,* and we obtain the approximate formula 

I{x,y\c')-I x (x,y,c') 

= 2&\^2nD 1 {x,y\c') ^2»jr*^(*, y) E x {x, y) - 2nj ~ (2-10) 

for the changes in the intensities of the knife-edge shadows which result when an 
arbitrary Bmail change of y(x, y) fringes is made in the figure of the mirror. 

* D x (x 9 y; c f ) becomes large near x as ± F, 



On the interpretation of the Foucault test 251 

The effect of introducing small errors on a previously true mirror is obtained as 
a special case on setting y) = 0. In this case, 

rr e icm~x) 

Fi(x,y) = 1, 2nl) x (x,y) *=-n + i\ -- dt (2-11) 

J-r t—x 

for x 2 +y*< 1, and (2-10) gives, for these values of x and y, 


I(x,y; c')-l 0 (x,y, c') 


- 291 

{(* */_ r 7-x ■*)(*”*vlx.y) 27T J_ r V t-J em ~ x)dt )} 




+ 2 

: f CO ~ ~ dt 12 n % 7f(x t y) — 27r f sin c'(l-x)dt) ; 


4"«L c ')~ 

!o( x <y> c ')] = C(x,y)^nr){x,y)-^ K j~ sin c'(t—x)dt\ 



- [ 1 - S(x, y) J f 1 cos c’(t - x) dt, 

J —Y t-X 

(2-12) 

where 

Y = y/(l-y 2 ), c’ = 27tc/As, 


C(x,y) = ~J 



S(x,y) = ij 

r du - A [Si (c'( Y + x)} + Si {c'(Y - *)}], 

(2-13) 

while 

U x ’y> c/ ) = 7r 8 {[C(*, y)] 2 + [ 1 - S(x,y)]*} 

(2-14) 


represents the intensities seen on the surface of a true mirror at the same knife-edge 
setting. The condition (2-7) is always satisfied if the slopes of the ‘error-surface’ 
s « 7 )(x,y) are everywhere small enough for their squares to be neglected; that is, 
it is satisfied by small, slow errors of arbitrary form. 

3. Local zonal errors 

To discuss the appearanoe of a narrow local zone under the test, suppose that 
\/r(x, y) is radially symmetrical and that it vanishes outside the zone of width 2$ and 
mean radius r. 

During the removal of zonal error from surfaces of nearly perfeot revolution- 
symmetry, attention is usually concentrated on the ‘main diameter’ of the surface 
(that is, the diameter running at right angles to the direction of the knife-edge) 
and the form of the zonal error is inferred from the intensities seen along this dia¬ 
meter. These intensities are obtained on setting y = 0 in (2-10). In the case of a local 



252 


E. H. Linfoot 


zone, where tj(x, 0 ) is zero everywhere outside the ranges — r—S^xi—r+d and 
r — 8 S z <r + $, they satisfy the relation 

fix, 0; c'l - /,(*,«; e') - 2j»{2>r2M*,'or?) r2in> ? (*, 0 ) *,(*, 0 ) 

(„) 

provided that x is not too close to the ends of the diameter. 

If E t (x, 0) and D^x, 0; c') are nearly constant as x varies on and near the zone, 
(3*1) can be replaced there by the approximate equation 

I(x , 0 ; c') - I x (x, 0 ; c f ) *= 4 n 2 0 ; c f ) 0 ) E\(p } 0 ) 

- 2^ 1 ( - r, 0)j~2l + Ei(r, 0) j'") «***»#]}, (3-2) 

where p — ± r according as x > 0 or x < 0. 

In particular, (3*2) can be used when 8 < 1 and E x (x, 0) varies only slowly over the 
whole range — 1 < x < 1 ; that is to say, in the case of a narrow local zone on a nearly 
true mirror. In this case, the distance (approximately 10 c'-units) through which 
the knife-edge must be moved laterally to change the appearance of the main part 
of the mirror from ‘fully lit* to ‘fully dark’ is small compared with the horizontal 
resolving power of each of the two segments —r — $<>t< — r + <$ and r~8<t^r+8 
cut off by the zone on the main diameter, and the value of e <c ' (< ^ x) is accordingly 
nearly constant over these two ^-ranges for the relevant values of c' and of x. 

To make the hypotheses more definite, we agree to say that x is ‘on or near’ 
the zone when it satisfies one of the two conditions 


\x-r\ S6<J, 
|* + r| £ 6<J. 


(3*3) 

(3*4) 


For such values of x, 

- E x (r, 0) e i ° ,<T ~ x) f r+ ^ dt + E x ( - r, 0) e- fo '< r+I > f r+> ^^ldt (3-5) 

Jr-i t — X J-r-t t — X 

approximately. When x satisfies (3*3), the second term on the right of (3*5) is small 
in comparison with the first; when x satisfies (3-4), the first is small in comparison 
with the second. In either case, (3*5) can be replaced by the equation 

P VMlIlMl e ic>a~x ) dt , E ,(p,0) f 1 Q-Xdr, 

J-i t-x J-lT-fe 

_ x-p t-p * 

p-rsgn*, i - r 

f sgn x is defined ae l, 0, -1 according as *>, =, <0. 


where 



253 


On the interpretation of the Foucault test 

and the function £(r) <= rf(t, 0) 

expresses the zone-profile in terms of the new variable r, which runs from — 1 to 4-1 
on a zonal segment of width 2f. 

(3'2) then gives, for points x on or near the zone, 

|^j{/(a:,0;c')-/ 1 (a:,0;c')} 

= E y (p, 0) [2 mm - 2 £ 1 “T| dT ]j 

- - m 3'{2 E^p, 0)} K{2Djp^c r ) E t {p, 0)}. (3-6) 

In the special case of a narrow local zone on an otherwise true mirror, (3-6) becomes, 
in virtue of (2-11), 

~- 2 {/(x, 0; c')} « - [Ci{| e'(l +p) |}- Ci{| c'(l -p) |}j£(£) 

- (l -2[Si{c'(l+p)} + Si{c'(l (3-7) 

a result which can also be derived from (2-12). When the knife-edge is centrally set 
(c' = 0), (3-7) takes the simpler form 

~{/(x,0;0)-/ 1 (x,0;0)} = -sgnxlog|~£(£)-J (3-8) 

In the more general case where small, slow irregular errors may also be present 
on the mirror, we write 

E 1 (x,y) = e Zni< H** v) » 1 + 2 nirj 1 (x,y) > 

and suppose so small that its square can be neglected. Then y t (x y y) - <f>(z,y) 

to the same order of approximation, and an application of (2*9), with 

fl oicAt~x) 

2nD 0 (Zy 0; c') ■» n + i ———* dt , (3*9) 

J -1 t — x 

in place of 2nD 1 and with D x in place of D, gives 

2nD 1 (x, 0; o') = 2jrD,(*, 0; c') E x {x, 0) - 2 n f 1 } e^'-^dt, (3-10) 

J -I t — X 

2wD 1 (*,0; 7) E^x, 0) - 2itD 0 (x, 0; c') | E t (x, 0) |» 

-2itE l (x,Q)\ l (3-11) 

J -1 t — x 

Since the irregular errors are assumed to be slow, the integral 

n 





254 


E. H. Linfoot 


is of the same order of smallness as q^x, 0) itself, and the product q t (x, 0) T(x, 0) 
can be neglected. (3-11) therefore gives 

2TTDjx^\c r ) Ey(x, 0) = 2nDjx^oTc’)-2n f 1 ^ - °I Z 3 ^l 92 (3-12) 

J -l t — x 

and on substituting in (3*6) we obtain 


^{/(x,°; c')~ 4(^0; c')} 


5® [J ‘ _ + *, J ’ _ &«<•.«=»■!&») „i„ 

J \T-i L ^J-x <-/> J i t-p J 


The last terms in the two square brackets are respectively less than 2nT(p, 0), 27 \p, 0) 
in absolute value; they can therefore be neglected to the present approximation. 
It follows that (3*7), and its particular case (3*8), remain valid approximations when, 
in addition to the narrow zonal error described by f(r), small irregular slow errors 
are present on the mirror surface. 


4. Interpretation of the test 

Along the main diameter y « 0, the inferred error-slope of the mirror on and near 
a weak, narrow zone is proportional to I(x, 0; c') — J x {x y 0; r'). It varies somewhat 
as the lateral setting of the knife-edge is varied, although the variations in the shape 
of the inferred error-profile are not large enough to attract notice in the ordinary use 
of the test. 

When c/ = 0, i.e, when the knife-edge is central, the inferred error-profile P(x) 
or £*(£) on the mirror is given, apart from an appropriate scale factor* and an 
arbitrary additive constant, by the equation 

P(X) = - JV(*> 0)-Ii(X' 0; 0 )}dx, (4-1) 

and (3*8) therefore yields, on and near the zone, the approximate formula 
£*(£) = P(x) - agnx log l-—* J* £(«) du + J* (J i ~~~~ dr j du 

= sgnx log g(u)du+J ^ £(t) log ^~]T\ dTl ( 4 ' 2 ) 

* An unknown scale factor is necessarily involved, since the intensities seen under the test 
with a very small pinhole are determined by the deptlis of the errors measured in wave¬ 
lengths and not by the angular sizes of the error-slopes. Two mirrors of different sizes but the 
same shape, for example two paraboloids of different sizes but the same fooal ratio, therefore 
show different shadow-intensities under the test. 




255 


On the interpretation of the Foucault test 

(4-2) expresses the distortion which the zone-profile undergoes when ‘seen* under 
the Foucault test with the knife-edge centrally set. That is to say, it describes the 
‘intrinsic error’ of the test when applied to a weak, narrow zone of mean radius r 
on an otherwise nearly true mirror. 

The corresponding formula when the knife-edge is non-central is obtained by 
substituting from (3*7) into (4*1); it is 

C*(C« [Ci{|c'(i+^)|}-a{|c'(i-/>)|}] 

+ (l -1 [Si {c'(l +/>)} + Si {c'(l -/>)}]) j 1 1 t(T )log -j dr. (4-3) 

Equations (4-2), (4*3) allow some general conclusions to be drawn about the 
relation between the inferred profile of a weak, narrow zone and its actual profile. 
(4*3) can be written 

£*(£) = f(r) log dr+ £(«) du, (4-4) 

where the coefficients 

A = A(p,c') = 1 -l [Si {.'(1 +/>)} + Si(f/(l -/>)>], 

7 T 

B - B(p,c') - Ci {| c'(l +p) |>-Ci{|r/(1 -p) |}, (4*5) 

depend on p « r sgn x and on c\ but not on 8. It follows that the apparent distortion 
imposed by the test on the zone-profile is independent of the zone-width ; two narrow 
zones of the same mean radius but of different widths will possess the same inferred 
profile £*(£) if they possess the same actual profile. 

By (4*4), £*(£) is a linear combination of the two functions 

^)=J‘ j C(r)log| 7 —|dr, (4-6) 

and £«>(£) - f* «*)*»• (4-7) 

The first of these is a Faltung or ‘smear’ of £(r) by means of the function log 1/J r |. 
Rapid changes in the slope of the zone-profile are smoothed and rounded off by the 
smearing process, which also * spreads ’ the zone, so that £ (J) (r) does not vanish every¬ 
where outside the interval — 1 < r < 1, as £(t) does, but tails off gradually. Figure 1 
illustrates this in the particular case 

£(r) = (l-r*) a ( 1 <t<1), 

*0 (r*£ 1). (4*8) 

When the zone has a symmetrical profile, i.e. when £(r) is an even function of r, 
£<*>(t) is likewise an even function and the ‘ smeared ’ zone is symmetrical. However, 
the inferred profile (4*4) is not symmetrical, because of the effect of the term in £ (a) (£). 



256 


E. H. Linfoot 


The function £ (,) (§) is constant outside the interval ( — 1,1) but its value is not in 
general the same on both sides of the interval. In the oase of a high or low zone, 
where £(r) is of constant sign in (— 1 , 1 ), the smeared function is monotonic 
in each of the (--ranges outside this interval; therefore the inferred profile 

is likewise monotonic in ( — oo, — 1 ) and in (l,oo), whatever may be the values of 
A and B. That is to say, at no setting of the knife-edge do maxima or minima of the 



B/A 



inferred zone-profile ever lie outside the actual zone. But their positions within the 
actual zone vary when the ratio B/A varies; that is, they depend on the lateral 
setting of the knife-edge and on the situation of the zone on the mirror disk. 

The curves in figure 2 show the values of B/A as c' varies, for zones of mean radius 
0-25,0-5,0*65,0*80 respectively; the values of c' which are of most practical import¬ 
ance are those between — 2 and +1 (Linfoot 1946 , p. 86 ; figures 6 a, 6 ). In this range 




On the interpretation of the Foucault teat 257 

of values of c', B/A may be approximated to with sufficient aocuraoy for practical 
purposes by the expression a (p) (o' + 2 ), where 

a (p) = 0*25, 0*5, 0*7, 1*1 

for zones of mean radius 0*25, 0*5, 0*65, 0*80 respectively. 

In figure 3 are shown the inferred zone-profiles, calculated from (4*4), corre¬ 
sponding to the actual profile (4*8) in the oases c f = — 1 , c' = 0 . The value r/ « — 1 
corresponds to an approximately ‘half-lit ’ mirror disk and the value c ' = 0 to a 
centrally set knife-edge; in the latter case, the disk contains only one-third as much 
light as with the knife-edge withdrawn (Linfoot 1946 , p. 90). It will be seen that the 
inferred zone spreads well beyond the boundaries of the actual zone and that its 
crest is displaced outward by an amount which increases as c' increases from — 1 
to 0 . The displacement, though not very noticeable on the inferred profile, amounts 
to an appreciable fraction of the actual zonal width when o' — 0 . 



FmmiK 3. Inferred error-profiles (4*4) in the case (4-8). The dotted curves show the sliape 

of the actual error-profile. 


The zone ‘seen' under the test in practice is a composite impression built up from 
the inferred profiles for the values of c' which make the zone show up most pro¬ 
minently on the mirror disk, and these values may be taken to lie between — 2 and 
4 * 1 when the mirror is nearly true. A ring-polisher made to suit the inferred zone 
should therefore be foupd to work rather too much on the outer slopes of the zone 
and, because it is too broad, should produce a somewhat different effect from that 
intended. To reduce the zone with greater efficiency, the ring-polisher should be 
designed to work almost entirely on the central part of the apparent zone and should 
be set slightly inside the inferred crest-radius. 

The well-known difficulty, in correcting mirrors by local polishing with the help 
of the Foucault test, of preventing the polisher from going outside the limits of the 
local error under treatment is usually regarded as a difficulty in the manual control 
of the polisher. According to the present analysis, it may be attributed, in part at 
least, to the errors inherent in the usual method of interpreting the Foucault 
shadows. 

It is natural to ask how far these conclusions, established only for sufficiently 
narrow zones of very small height or depth, remain valid as useful approximations 
when applied to zones of the widths and depths commonly met with in practice. To 
estimate this, computations of the intensities in four selected special cases were made 
from the basic formula ( 2 * 3 ), which for practical purposes can be regarded as exact, 



intensity 


Figure 4. Foucault intensities, at two different lateral settings of the knife-edge, for a local 
zone fringe high. The dotted curves show the intensities for a true mirror. The error -profile 
is shown below the graph. 



Figure 6. Foucault intensities, at two different lateral settings of the knife-edge* for a local 
zone i fringe high. The dotted curves show the intensities for a true mirror. The error-profile 
is shown below the graph. 









259 


On the interpretation of the Foucault test 

and the corresponding inferred profiles were computed from (41). There is of course 
a certain arbitrariness in the use of (4* 1) to define the inferred error-profile when the 
intensity-changes are no longer small. 

The four cases selected were high zones of mean radius 0*65, width 0*20, profile 
shape £( 7 ) given by (4*8), and heights ^ fringe and £ fringe respectively, tested with 
the knife-edge at the lateral settings c' = — 1 and c ' = 0. 

Figures 4 and 5 show the results for a zone of height ^ fringe. It will be seen that 
in this case the inferred profile derived from the approximate formula (4*4) agrees 
sufficiently well in shape with that derived by computation from the accurate 
formula (2*3). 



Figure 7. Inferred error-profiles corresponding to figure 0. The dotted curves show the 

shape of the actual error-profile. 


The results for a zone of height £ fringe are shown in figures 6 and 7. Here the 
inequality between the apparent strengths of the zone on tho left- and on the right- 
hand zonal segments is considerable, and the agreement between (4*4) and the 
accurate formula is not very close. This is hardly surprising, since the condition that 
the depth of the zone should be small compared with A/2zr is far from being satisfied. 
Even in this case, however, it appears from figure 7 that the general conclusions 
drawn above retain sufficient validity to bo useful to the practical mirror-maker. 


References 

Foucault, L. 1859 Ann. Obs. Imp. Paris , 5, 197- 237. 
OaHcoigne, S. C. G. 1945 Mon. Not. R. Astr. Soc. 104, 320-334. 
Linfoot, E. H. 1946 Proc. Roy. Soc. A, 186, 72-99. 


Vol, 193. A. 


17 



The hydrodynamics of non-Newtonian fluids. I 

By R. S. Rivuln 

British Rubber Producers' Research Association , Welwyn Garden City, Herts., 
England, and National Bureau of Standards, Washington, D.C., U.S.A . 

(Communicated by E. K . Rideal, F.R.S.—Received 12 November 1947) 


The classical theory of the hydrodynamics of viscous fluids depends on the assumption of a 
particular law governing the relations between the components of stress in a fluid and those 
of the strain-velocity. This assumption limits its applicability to Newtonian fluids. Here, the 
most general possible relations betwoen the stress and strain-velocity components, which 
can be obeyed by an incompressible, viseo-inelastic fluid, are derived. These relations also 
apply to an incompressible, visco-clastic fluid in a steady state of laminar flow. It is shown 
how equations of motion and boundary conditions can be obtained if these relations are 
known. Two problems involving laminar flow are then discussed in some detail. These are: 
(i) the torsional motion of a cylindrical mass of fluid, produced by means of forces applied 
to its plane ends, and (ii) the laminar flow of a mass of fluid contained between two coaxial 
cylinders rotating with different angular velocities. 

It is found in case (i) that, in general, normal tractions must be applied to the plane 
surfaces of the fluid, in addition to the azimuthal tractions expected from the classical 
theory, in order to produce the specified motion. Analogous results are obtained in case (ii). 
These results apply even when centrifugal forces are neglected and so imply a qualitative 
difference between the behaviour of fluids in general and those for which the special case of 
classical hydrodynamics is valid. 


1. Introduction 

The classical theory of the hydrodynamics of viscous fluids seeks to answer questions 
of the following types: (i) If a mass of fluid is subjected to a specified system of forces, 
what is the resulting flow configuration in the fluid ? (ii) What are the forces necessary 
to produce a specified state of flow in a fluid ? 

In order to answer these questions, some mathematical description must be given 
of the fundamental physical properties of the fluid as regards flow. In the classical 
theory, the assumption is made in the form of a definite relationship between the 
stress components at a point of the fluid and the strain-velocity components. This 
relationship is linear, invariant under a transformation of the axes of reference and 
does not involve time derivatives of the stress or strain-velocity components. From 
a mathematical point of view, it forms a sufficiently comprehensive description of 
the basic flow properties of an ideal fluid to form an adequate starting-point for the 
development of a mathematical theory. The applicability of the results of this theory 
are, of course, limited by the assumed stress-strain-velocity relationships. The wide¬ 
spread agreement of the predictions of the theory with the experimental results, 
for such fluids as water, air, mercury, eto., provides justification for it and inspires 
confidence in its predictions. 

It has long been realized, however, that the fundamental stress-strain-velocity 
relations of classical hydrodynamics are not applicable to a vast number of highly- 
viscous fluids. Many of these have great technological importance and, with the 

[ 260 ] 



261 


The hydrodynamics of non-Newtonian fluids . I 

growth of scientific investigation of macro-molecular materials and colloids, an 
increasing scientific interest. To avoid confusion over the use of these terms, it is 
stated explicitly that, in this paper, fluids will be described as Newtonian or non- 
Newtonian accordingly as the classical hydrodynamics does or does not apply to 
them. Non-Newtonian fluids may be either visco-elastio or visco-inelastic. 

In the present paper, a hydrodynamic theory is formulated without the restric¬ 
tions implied by the assumed stress-strain-velocity relations of the classical theory. 
The only physical assumptions made regarding the fluid are (i) that it is in¬ 
compressible, (ii) that it is visco-inelastic, and (iii) that it is isotropic when all 
components of the strain-velocity, as classically defined, are zero. 

The mere fact that the material considered is a visco-inelastic fluid implies that 
the stress-strain-velocity relations do not involve time derivatives of the stress or 
strain-velocity components. The classical theory, for incompressible fluids, forms 
a special and simplest case of the theory developed here. 

In part A, the kinematics of flow is developed in a form suitable for the purposes 
of the present theory. 

In part B, the most general possible form of the stress-strain-velocity relations, 
subject to the limitations implied by the assumptions made above, are obtained and 
the equations of motion and boundary conditions are discussed. 

In part C, two simple problems involving steady-state laminar flow are discussed. 
These are: (i) the torsional motion of a cylindrical mass of fluid, produced by forces 
applied to the plane ends, and (ii) the torsional motion of a mass of fluid between 
infinite coaxial cylinders rotating with different angular velocities about their 
common axis. 

In both cases, the relation between the force system and the flow-velocity and the 
resulting stress distribution in the fluid are derived. 

Throughout the paper a rectangular, Cartesian reference system has been used. 
Much of the discussion could, of course, be readily expressed in the notation of the 
tensor calculus, but in the interests of ease of application and availability to a wider 
circle of readers, this has not been done. 


A. THE KINEMATICS OF FLOW 
2. Description of a state of steady flow 

Consider a viscous, incompressible fluid in a state of steady flow. The state of 
flow may be described completely with reference to a fixed rectangular, Cartesian 
co-ordinate system (x>y,z). The velocity of a point of the fluid which is at (x>y, z) 
has components w, v and w parallel to the axes %, y and z respectively. Since the 
state of flow is steady, u, v and w are functions of x , y and z only and independent of 
time. 

The velocities of two neighbouring points P and P' situated at (x t y 9 z) and 
(x + 8x y y+dy, z + (te)respectively have components («, v t tv) and {u + &u 9 + Sw) 



262 


R. S. Rivlin 


respectively. Then Su , Sv and Sw are the components of the velocity of P' relative 
to P. Since u, v and w are functions of x, y and z only, we have 

Su = u x 8$+ u y Sy 4 u z Sz, (2*1) 

together with two similar equations for Sv and Sw. 

The velocity of P' relative to P can be considered to be made up of two parts: 

(i) that du© to the pure rotation of the element of volume containing PP'; this 
has components Sw v Sm 2 and Sw Bt given by 


Sw x * %(y8z-£$y)> ^ = KCAr-gAs) and Sm z = %(£8y-i}8x) t (2*2) 

in which £ = — v z , ?] = u s — w x and £ ~ v x — u y ; (2*3) 

(ii) that due to the irrotational deformation of the element of volume con¬ 
taining PP'; its components 8u\ Sv' and 8w f are given by 


and 


8u ' « a8x + \h8y 4 \g8z> 
8v f = ^hSx + bSy + ^fdz > 
8w r — \g 8x -f \fSy -j-cSz, ^ 


(2*4) 


where 

a » w*, 6 - V c = w*, / « 4 w y% g = u> x + u z and h » 4 (2*5) 


It can readily be seen that 

8u = 4 (yur 1} 4* ft nd ^ = Sw’ 4 &zr 3 , (2*0) 

where and £«> are given by (2*1). 

The magnitude SV of the relative velocity (5w, 8v, Sw) is given by 

(<SF) a = (8u)* + (8v)* + (8w)* 



- A(Sx) 2 +£(Sy) 2 +C(Sz) 2 + F$y8z + Q$z$x+HSxSy, 

(2-7) 

where 

A - u% + v* + «£> i 1 = 2(« y u s + v„v t + w y w.), "j 
£ = «* + «* + «>*, 0 = 2(u.u x + v x v x + w.w x ), 

C - «® + v* + w*, tf = 2(tt x u„ + v x v v + w x w y ).J 

* 

(2-8) 


Again, the magnitude 8V' of the irrotational relative velocity (8u\ 8v\ Sw') is 
given by 

(svy - ( suy + (Svy +(#«/) 2 

= J '(&r) 2 4- B\Sy )* 4- C"(<te) 2 4- P' & 4 <7'A* At 4 H f SxSy t (2*9) 


where A' * a 2 4 £(0 2 4 A 8 ), P' «= 4 {b 4 c)/,' 

P « 6*4 J(A 2 4/ 2 ), O’ « W4(c4fl)^ * 
C f = c 2 4 ^(/ 2 4f7 2 ), P , -i/^4(a46)A, J 


( 2 * 10 ) 


and a, 6, c, /, g and A are given by (2*5). 



The hydrodynamics of non-Newtonian fluids. I 263 

Since the length 8s of the element PP* is given by 

(fo) 2 - (ftr)»+(ty)* + (&)\ (2-11) 

and its direction-cosines (l t m t n) by 

l ~ Sx/Ss, rn = 8yj8s and n = Sz/Ss, (2*12) 

we obtain, from (2*7), 

(dV/dsf = (SV/Ssf ~ Al 2 + B?n*+ Cn 2 + Fmn + Gnl+Him, <2*13) 

and, from (2-9), 


(dV'/ds)* = (ir/fe) 1 - 47 a + 5W+ C'n* + + + (2-14) 

If 6 is the angle between the directions of the relative velocity (8u> 8v y 8w) and the 
linear element PP' y we have 

£F<focos0 as 8x8u + 8y8v + 8z8w 

= a(8x)* + b(8yy i + c(8z) i ^rf8y8z + g8z8x + h8x8y t (2*15) 

Thus, employing the relations (2*11), (2*12) and (2*15), we have 
(dV jds) cos# = (8V jSs) cos d 

= al 2 -f brn 2 + rn 2 -f /«m + gnl + him . (2* 16) 

If is the angle between the irrotational relative velocity (8u\ 8v and the 
element PP\ we can similarly obtain 

<JF'<Jacos0' * a(8x) 2 + b(8y) 2 + c(8z)' 2 +f8y8z + g8z8x + h8x8y, (2*17) 

or (dV’Ids) cos 6' ~ (8V f I8s)cos0 f 

= al 2 + bm z + cn 2 + fmn 4* (2* 18) 

3. Thk flow quadrics 

If we consider a small element of the fluid in the neighbourhood of the point 
{x y y } z) and take (8x y 8y> 8z) as the current co-ordinates of a point P r in this element, 
in the co-ordinate system (8x y 8y> 8z) whose origin P is at ( x , y, z) and whose axes are 
parallel to those of the original reference axes (x,y,z), then equation (2*7), in which 
£F is given some constant value, represents the surface on which the velocity of P r 
relative to P has this value. We can choose the axes of reference (#, y, z )—and hence 
( 8x , 8y , 8z )—to be parallel to the axes of this quadric. The axes of reference will, of 
course, then vary in direction from point to point of the fluid. With this choice of 
axes the equation of the quadric takes the form 

V\{8x) 2 + V\ (Sy)*+ V\{8z )* « (8V) 2 , (3*1) 

where Ff, V\ and F| are the values of F 2 given by 

A — V 2 \H \G 
\H B-V 2 IF 
\0 IF C-V 2 


« 0 . 


(3*2) 



264 R. S. Rivlin 

If we put 8y = Sz = 0 in (3*1), we obtain 


(£F ) 8 - Ff(<fcr) a . 

Thus, V\ is inherently positive. Analogously, V\ and V\ are positive. Therefore, the 
quadric represented by equation (3*1) or (2*7) is an ellipsoid. 

The lengths of the semi-axes of the flow ellipsoid (3*1) or (2*7) are proportional to 
l/V v l/V 2 and ljV z and it should be noted that the motion of the element of fluid in 
the neighbourhood of P—apart from any translational motion of the element as 
a whole—may be considered to consist of three components of relative motion, 
parallel to the axes of the ellipsoid (3*1). These components are V v V % and V z per unit 
length in the appropriate direction. 

In a similar manner, equation (2*9), in which 8V' has some constant value, repre¬ 
sents a surface on which the irrotational portion of the velocity of P' relative to P, 
has the value 8V\ Again, we can choose the axes of reference to be parallel to the 
axes of the ellipsoid (2*9). Then, equation (2*9) takes the form 

FJW+ V’*(8y)*+ F'W - (*n 2 , (3*3) 

where Fj 2 , V 2 2 and F 3 2 are the values of F ' 8 given by 


A'-V 2 

w 

w 



\w 

B' - V' 2 


= 0 . 

(3-4) 

w 

\F' 

C — V’ 2 




As in the case of equation (2*7), it can be seen that the quadric represented by ( 3 * 3 ) 
is an ellipsoid. 

The lengths of the semi-axes of the flow ellipsoid (2*9) or (3*3) are proportional to 
1 /Fi, 1 /FJ and I/F3. Again, it is noted that the irrotational portion of the motion in 
the fluid element in the neighbourhood of P may be considered to consist of three 
components of relative motion, parallel to the axes of the ellipsoid (2*9). These com¬ 
ponents are V' v V' 2 and F 3 per unit length in their directions. 

Again, let ue consider the flow quadric (2*17). Transforming this to its axes as 
axes of reference, it takes the form 

V x (»x) % ^U % (iy^U t (9z) % = SV'Ss cos0', ( 3 - 5 ) 

where U v U 2 and V 2 are the values of u given by 

a - U \h \g 

ih b- U £/ « 0 . (3*6) 

\9 U c-U 

It can be seen that the direction of the velocity 8V' at any point of the surface of 
the quadric (2*17) or (3*5) is that of the normal to the quadric at that point. 



266 


The hydrodynamics of non-Newtonian fluids. I 

4. Scalar invariants or the state of flow 


From equation (3-2), we see that FJ, FI and F§ are the values of F® for which the 


equation 


F*-7 1 F« + / a F*-/ a = 0, 


(4-1) 


in which J x — A + B + C, 

/ a = AB + BC+CA-lFt-IGt-lH* 
and / 8 = ABC + \FGH - \AF 2 - J BG *- \CH\ 


(4-2) 


is satisfied. 

Thus, we have l x = FJ-f Ff + F|, 

A-FJFJ + FJFJ+FSn 


(4-3) 


and h=V\V\Vl ) 

Since the solutions of (4-1) for F 2 are independent of the choice of the axes of 
reference, any symmetrical function of FJ, F| and FJ must be independent of the 
choice of the axes of reference and must therefore be a scalar invariant of the flow 
with respect to a transformation of axes. In particular, the three independent 
functions of FJ, l 7 ! and F 2 , given by equations (3-2) or (4-1), must be scalar invariants 
of the flow. This could be directly proved by carrying out the transformation of the 
expressions on the right-hand side of equations (4-2) from one rectangular, Cartesian 
co-ordinate system to another, bearing in mind that A, B, C, F, G and H are defined, 
in any rectangular, Cartesian co-ordinate system, by (2-8). 

Again, by writing equation (3-4) in the form 


F' 6 — .J x V* + J 2 F' 2 — «4 = 0, 
where J x = A’ + B' + C, 

J. t = A’B' + B'C’ + C'A'-lF't-iG '*-|//' 2 
and J s = A'B'C + J F'G'H' - iA'F'^-^B'G' 2 -\V'H\ 


(4-4) 

(4-5) 


we readily see that if FJ 2 , F a * and FJ 2 are the values of F' 2 for which equation (4-4) 
is satisfied, ^ = yt + yz + \ 

•4- FJ*Fi*+F a ®FJ 2 +F 8 2 Fi 2 l (4-6) 

and J 3 = Fi 2 F' 2 F' 2 . J 

By an argument similar to that used in establishing the invariance of I v l t and / 8 , 
it can be seen that J v J t and </ 8 , given by (4-5) or (4-6), are independent scalar 
invariants of the flow. 

Again, by writing by equation (3-6) in the form 


U 3 -K x m + K t U-K,m 0, (4-7) 

where K x * U x + U a +U s =■ a + b + c, 

K ^ U ^ + UtUi + UM^ab + bc + ca - iP - W - lh * ■ (4-8) 

and K a « U x U a U a = obc + \fgh- K 2 «-~ W, 

we can establish that K x , K t and K a are independent soalar invariants of the flow. 
U x , U t , U t are the values of U for which equation (4-7) is satisfied. 



266 


R. a Rivlin 

5. Relations between the scalar invariants of flow 


In this section we shall state certain relationships between the scalar invariants 
of flow K v K 2 and A r s , defined by (4*8), and the scalar invariants of flow J v J 2 and 
t/ 3 , defined by (4*5). 

The relations are 


J x » - 2 K 2 + K\, J 2 » K* - 2 K x A 3 and J s * JCJ. 


(5*1) 


These may be readily proved by employing equations (4*8) to express K lt A a and 
A 3 in terms of a , 6 , c,/, ^ and h and equations (4*5), together with equations (2*10), 
to express J v J 2 and «/ 8 in terms of a, b> c,/, g and h. By simple algebraic manipulation, 
the relations (5*1) are derived. 

If the fluid is incompressible, it can readily be shown (see, for example, Lamb 
( 1924 , § 12 )), that 


K 1 = a -f b 4* c — 0. 
In this ease, the relationships (5*1) become 


t/j — — «/ 2 


A' | and J 3 = A|. 


(5*2) 


(5*3) 


The following further relationship, which is readily proved, by simple algebraic 
manipulation, from (2*10) and (4*8), is worthy of note: 


aA' + bB' + cO' + if F'+ \gQ' + \hH’ - K\- 3A\ A a + 3A a 


(5*4) 


If the fluid is incompressible, so that the relation (5*2) is obeyed, equation (5*4) 
become* aA , + bB> + c(V + y F , + ^ gQ . + y m . = 3A ' s ( 5 - 5 ) 


6. Relations between the axes of the flow' quadrics 

In tliis section we shall first prove that 

V i* = C7f, FJ 2 . U\ and V'* = * 7 §, ( 6 - 1 ) 

where FJ, F g and Fg and 17,, ? T 2 and U a are defined as in §3. Then, it will be shown 
that the axes of the flow quadrics (2d)) and (2*17), at any point, are parallel. 
Squaring equation (4-7), we have 

U 9 - 2K 1 U » + (A 2 + 2 K t ) V *- (2A 3 + 2 A,A 2 ) U 3 

+ (2K 1 K 3 + Kl)U*-2K i K a U+-Kl = 0. (6-2) 
From (4-7), U* « A, l / 2 -K 2 U + A s . 

Introducing this expression for U a in the tenn of ( 0 - 2 ) in U®, we have 

U 9 - (A? - 2 A g ) U* - 2 A a £/ 3 + A|C7 2 - 2A 4 A s C7 + A| , 0 . (6-3) 

Again, from (4-7), K. % U = A,i7 2 +A s —?7 3 . 

Employing this to substitute for A, U in the term - 2K t K s U of (6*3), we obtain 
U 9 -(K\- 2A 8 ) + (AJ - 2 A,A S ) t/ 2 -A 2 = 0 . (6-4) 



267 


The hydrodynamics of non-Newtonian fluids . I 

Introducing into (6-4) the relations (5-1), we obtain 

C/e _ j x [/* + J% jp _ j s * o. (0*5) 

Comparing equations (6*5) and (4*4), it can readily be seen that the values of U 2 for 
which equation (0*5) is satisfied are equal to the values of V' z for which equation 
(4*4) is satisfied. Now, since equation (0*5) is obtained by squaring (4*7), the values 
U v V t and U 3 of U which satisfy (4*7) also satisfy (6*5). Thus, we obtain the relations 
( 6 * 1 ). 

Let (l v m v n x ) be the direction-cosines of an axis of the flow ellipsoid (2*17) with 
respect to the co-ordinate system (#, y, z). Then, if V 1 is the value of V for this axis, 
l v m l and n x are given by 

(a - U x ) l x + \hni j + \gn x = 0/ 

\hl x + (b - U x )m t -f \fn x = 0 - (0*6) 

and \gl x 4 - bfm x + (c — U x ) n x = (V 

Multiplying these three equations by a, \h and respectively and adding, bearing 
in mind the relations ( 2 * 10 ), we obtain 

A'l l + lH f m l + \G f n x — U x {al x + \km x +\gn x ) = 0 . .(6*7) 

Substituting from the first of equations ( 6 * 6 ) in (6*7), we obtain 

(A' - U\) l x + lH'm x + iO f n x - 0. 

* 

Introducing into this equation the first of the results (6*1), we obtain 

(A' - V i 2 ) l x + * °* (6*8) 

In an analogous manner, we can obtain the further equations 

and ^7 1 + Ji ,, m 1 + (O'-F ;*)«. 1 = 0 .J 

Now, equations ( 6 * 8 ) and (6*9) are the equations determining the direction-cosines 
(t x ,nt v n x ) of an axis of the ellipsoid (2*9) with respect to x, y , 2 . Thus, the flow 
quadrics (2*9) and (2*17) must have their axes parallel. 

7. Transformation of the strain-velocity components 

If the rectangular, Cartesian co-ordinate systems (x,y,z) and (* 0 , .Yo» ~o) are 
oriented with respect to each other according to the scheme 

x y z 

y 0 h m 2 n 2 

Zq l 3 171$ 7l 3 

then x 0 » + + 

y 0 = l t x + m % y + n t z 
z 0 « i 3 x + m 3 !/ + n 8 z. 



and 


(7*1) 



268 


R. 8. Rivlin 


If (*w 0 , v 0 , w 0 ) are the velocity components in the co-ordinate system (x 0 , y 0 , z Q ) 

corresponding to the velocity components ( u , v, w) in the co-ordinate system (x t y> z) f 

vP-e have , , , , , 

U — -h H- 

* V = w 1 tt 0 + m 8 v 0 + m 8 ^ 0 - (7-2) 

and w =s n x u 0 -f n 2 t> 0 + ^z w o* 

Now, employing the relations (7*1) and (7*2), the six equations relating 
(a, b , c, i*) and (a 0) 6 0 , c 0 , i/ 0 > & 0 > \K) 

can be obtained as 


a = u x = Zfa 0 4-^6 0 + Z|c 0 -f hhfo + khQQ + hhK* etc * 
and £/= i(v B + to y ) = w x » 1 a 0 + m 2 w 2 6 0 + m 8 w 8 c 0 +(m a %-f m 8 n a )(|/ 0 ) 
+ (»»,«! + %» 3 ) (J|/ 0 ) + Knj + m a ^) (Jt 0 ), etc., 


(7*3) 


where a 0 , 6 0 , c 0 , £/ 0 , |gr 0 , \\ are the components of the strain-velocity in the co¬ 
ordinate system (x 0 , y 0 , z 0 ); i.e. a 0 « du 0 /dx 0y etc. 

In (7*3) the expressions for b and \g are obtained from those for a and by 
replacing l by m y m by n and n by l. Those for c and \h are similarly obtained from 
the expressions for b and \g. 

From the relations (7*3) we can readily find A\ B\ C\ \F\ and \H f in terms 
of Aq, B' 0i g o> i G o and Wo> where 


A'a * ao + i(^o + A o)> etc * and F' 0 = y 0 h 0 + (6 0 + c o)/o> etc. 

We obtain 

A' = Q^ + 4£70 + + 22 x l s (i£r^) > etc/ 

and \F’ « m 1 n 1 AQ^m 2 n 2 B f 0 + m z n z C f 0 ^ (m 2 n 8 + m 3 n s ) (JFq) 


(7*4) 


(7*5) 


B. THE DYNAMICS OF NON-NEWTONIAN FLUIDS 
8. THi2 DISSIPATION-FUNCTION 

If we consider an element, of volume dr, in the neighbourhood of the point 
P(x, y t z) of a mass of fluid, then the rate at which energy is dissipated in this volume 
in the form of heat is <J>dr (say), where <D is dependent on the state of flow within the 
element of volume. We shall call 0 the dissipation-function.^ Now, 4> cannot be 
dependent on the choice of the axes of reference and must therefore be a scalar 
invariant of the state of flow. 

We have already seen in § 2 that the state of flow may be completely described by 
specifying a purely rotational velocity for the volume elements in the neighbourhood 

* The dissipation-function as defined here is not identical with the Rayleigh dissipation - 
function. The latter is equal to half the rate at which energy is dissipated per unit volume. 



269 


The hydrodynamics of non-Newtonian fluids • I 

of eaoh point of the fluid together with a velocity of relative separation of the various 
points of the element. The purely rotational motion of the element involves no 
relative separation of the points of the fluid and therefore involves no energy dis¬ 
sipation in the element itself. 

The dissipation-function $ is therefore dependent purely on the irrotational com¬ 
ponent of the flow and is therefore a function of the scalar invariants K v K 2 and K d 
of the irrotational component of the flow. Bearing in mind that for an incompressible 
fluid K t =* 0 , the dissipation-function is then a function of A a and A 3 only. Thus, 

(D-d>(A^A%). ( 8 - 1 ) 

Whatever the form of O as a function of A 2 and A 3 , it can be approximated, with 
any desired degree of accuracy, by a polynomial expression. Thus 

* * S S (8-2) 

where are constants characterizing the particular fluid considered. Since, when 
the fluid is stationary, A r 2 , K z and must all be zero, /j 00 = 0 . 


9. The relation between stress and state of flow 


It has been shown (see, for example, Lamb ( 1924 , § 329)) that the rate <S> at which 
energy is dissipated, per unit volume of a fluid, is related to the stress components 
tjw t vv ,..., by the formula 

~ txx a + tvvb + tsz c + ty3f + (9*1) 

The argument leading to this relation is a purely mechanical one and does not in¬ 
volve any assumptions regarding the physical properties of the fluid and so can be 
taken over here without change. 

In developing our hydrodynamic theory for incompressible, visco-inelastic, 
non-Newtonian fluids, it is now necessary to find a set of relationships between the 
stress components at a point and the values of the components a, 6 , c, If , \g, \h of 
the strain-velocity, corresponding to any specified dissipation-function. It can 
readily be seen that although this cannot be done uniquely, the specification of a 
particular dissipation-function limits the form which can be taken by the stress- 
strain-velocity relationships, because of the relation (9*1). 

Let us consider the stress-strain-velocity relation 


t xx * 2 /wr + p, etc. and t uz = /if etc. 


(9*2) 


This is the classical stress-strain-velocity relation for an incompressible, Newtonian 
fluid. It is readily seen, in view of equations (9*1), (4*8) and (5*2), that for such 

a fluid <D • — 4 pK t . (9-3) 


Again, let us oonsider the stress-strain-velocity relation 

t TX = 2fi'A'+p, etc. and t vz =* fi'F', etc. 


(9-4) 



270 R. S. Rivlin 

Then, in view of equations (9*1), (5-5) and ( 5 * 2 ), the corresponding form of the 
dissipation-function 0 is given by 

0 * 6fi f K s . ’ (9-5) 

For stress-strain-velocity relations having the form (9*2) or (9*4), it is seen that the 
corresponding dissipation-functions, given by (9*3) and (9*5), are functions of the 
scalar invariants of flow, A" a and A a . 

Generalizing this result, let us consider the stress-strain-velocity relations to 
have the form 

t xx — 20a-f* 2WA f +p, etc. and t yz = 0/+ TVF', etc,, (9*0) 

where 0 and T are any functions of the scalar invariants of flow K % and K z . 
Then, by similar arguments to those employed in deriving equations (9*3) and (9*5) 
from (9*2) and (9*4) respectively, we obtain the dissipation-function corresponding 
to the stress-strain-velocity relations (9*6) as 

0 = — 40A’ 2 +6 X FA 3 . (9*7) 

This dissipation-function is clearly a function of the scalar invariants of flow 
K 2 and A 3 . 

Of course, for a set of stress-strain-velocity relations to be admissable as the 
description of the flow properties of a fluid, they must not only lead to a dissipation- 
function which is independent of the state of flow, but must also be invariant under 
a transformation of the axes of reference. It can readily be shown that the relations 
(9*6) satisfy this condition and this will be done in the next section. 


10. Invariance of the stress-strain-velocity relations 


The stress components t rx J yy >..., t xy transform (see, for example, Love ( 1927 , § 49)) 
according to the law 


and 


4\r — 4 a x 0 *+- 4 4ei/o + 4 4 q* 0 + ^44 KqZq ^44 4 0 x 0 + ~4 4 4 0 i/ 0 ’ e ^ C * 

4* = m l W x4o^o + W 2 n *4oVo + m 3 W a4oC 0 + ( m 2 W 8 + ^%)4^ 

+ (w 3 n x + m x w # ) L qXq + (m 1 + m 2 %) etc.,. 


( 10 * 1 ) 


where ^ 0 » 4 i/o’ ,,, ' 4 vo are corn P onen t s of stress in the co-ordinate system 
(x 0 , y 0 , z 0 ) defined as in § 7. 

It is readily seen, by comparing equations ( 10 * 1 ) with equations (7*3) and (7*5), 
that the stress components t xx A yyi an< * quantities (a, 6 ,c,^/,^, £A) and 

(A\B\ C\\F f , \Q\ £#') all transform according to the same laws, when the axes 
of reference are transformed. The stress-strain-velocity relations (9*6) are therefore 
invariant under a transformation from one set of rectangular, Cartesian co-ordinates 
to another. It is noted that 0 and T, being functions of the flow invariants, are 
unaltered by a transformation of the axes. Also, p is unaltered by a transformation 
of the axes. 



The hydrodynamics of non-Newtonian fluids. I 271 

It is noteworthy that the law of transformation of the quantities (t XXi . or 
(a,£A) or (A \ £//') is that for a contra variant tensor. 

It has now been shown that the stress-strain-velocity relations (9*6) are admissible 
as the description of the flow properties of an incompressible fluid from the stand¬ 
points that they are invariant under a transformation from one rectangular, 
Cartesian reference system to another and that they lead to a dissipation function 
which is a function of the flow invariants. In the next section it will be shown that 
equations (9*6) represent the most general form which can be taken by the stress- 
strain-velocity relations for an incompressible, visco-inelastie fluid. 


11. The general stress-strain-velocity relations 


In this section, we shall prove that any stress-strain-velocity relations which are 
suitable for describing the properties of an incompressible, visco-inelastic fluid must 
have the form of equations (9*6) or be capable of being written in this form. 

If the axes (X, F, Z) of the flow quadrics, represented by equations (2*9) or 
(2*17), are taken as the axes of reference, then the stress-strain-velocity relations 
(9*6), in the neighbourhood of the point to which these flow quadrics apply, take 

the form t xx = 2@u y + 2'Kr/f+ v ; 

t ry = i»u t +wrv\+p, 

t zz = 2&V 3 + 2'YUI+p 

and ^ ^ hr ~ 9* 


(1M) 


Conversely, if the stress-strain-velocity relations take this form in the axial system 
(X, r, Z), they will take the form (9*6) in the axial system (x, j/, z). 

Since the fluid is isotropic in its state of rest, the stress-strain-velocity relations 
in the axial system (X, F, Z) must be symmetrical with respect to a cyclic rotation 
of the co-ordinates (X, Y,Z) and since the fluid is visco-inelastic the principal 
stress components must be functions of U v U 2 and U z only. Thus, the most general 
form for the stress-strain-velocity relationships in the co-ordinate system (X, F, Z) is 


txx — F(U l ,U i> Us)+P, 

t Y r — X(U 2 y [4, ^4) 

$ZZ ” U t) P 

and ^tz ** t>zx “ “ 9, 


where F is an arbitrary function. 

Two of the relations (4*8) can be used to eliminate U % and U z from the first of 
equations (11 *2), U z and U x from the second of equations (11*2) and V x and U % from the 
third of equations (11*2). Equations (11*2) then take the form 

^jjr ** U x )-hp f etc. 


(11*3) 



272 


R. S. Rivlin 


Now F'(K 2 , K s , U x ) can be expanded, with any desired degree of accuracy, over any 
desired range of values of U v as a polynomial in U v the coefficients in which are 
functions of the flow invariants K 2 and K z only. From (4*8), it can readily be seen that 

Ul^K s -U x K r 

Employing this expression to substitute for V\ in the polynomial expansion of 
F f (K 2 f K z> U x ) t we obtain 

F\K 2 ,K Zt U l ) = 2QU l + 2'¥Ul (11*4) 

where 0 and l F are functions of K 2 and K z only. Strictly, the right-hand side of 
(11*4) contains, in general, a further term which is a function of K 2 and K z only, 
but this may be incorporated with p in (11*3). 

In a similar manner, analogous expressions for F f (K 2 , f/ 2 ) and F'(K 2 , K z > U z ) 
can be found. Introducing these into equations (11*3), we obtain equations (11*1), 
which are therefore the most general stress-strain-velooity relations that can be 
used to describe the flow properties of a fluid. These are, of course, referred to the 
axes of flow as reference co-ordinates. Referred to the general rectangular, Cartesian 
co-ordinate system {x f y, z), they take the form ( 9 - 6 ). 

Reiner ( 1945 ) obtained stress-strain-velocity relations similar to ( 9 * 6 ) for a general, 
compressible fluid, in order to explain the phenomenon of dilatancy. In his equations 
p is replaced by an arbitrary function of the strain-velocity invariants K v K 2 
and K z> and 0 and x ¥ are also functions of these three strain-velocity invariants. It 
does not appear that the non-vanishing of HP* is in any way essential to dilatancy. 


12. The Eulerian equations of motion 

The Eulerian equations of motion for a fluid may be written (see, for example, 
Lamb ( 1924 , §328)) in terms of the stress components in the co-ordinate system 
(#, y> z) as 

(12 * 1 ) 


Du U x>x 

“m- pX+ 




where p is the density of the fluid and X y F, Z are the components parallel to the 
axes x t y and z respectively of the body force per unit mass of the fluid acting at the 
point {x f y , z). The operator D/Dt has the meaning expressed by 


D 0 d d d 

■m-st +u rx +v ry +w ii- 


( 12 - 2 ) 


Substituting in equations ( 12 - 1 ) the general expressions (9*6) for the stress com¬ 
ponents, we obtain the equations of motion for an incompressible, non-Newtonian 
fluid in terms of the velocity components u, v and w. 


13. The boundary conditions 

Let us suppose that surface tractions, of which the components parallel to the axes 
x, y and z are X v , Y v and Z„ respectively per unit area of the boundary, are applied 
to the boundary surfaoe of the fluid at the point (x,y,z) on it. X v , Y v and Z v are 



273 


The hydrodynamics of non-Newtonian fluids . 1 

functions of the position (x t y, z) on the boundary and of the time t. At each instant, 
the components parallel tox,y and z of the forces, due to the stress, at the boundary 
of the liquid must be in equilibrium with those due to the applied surface tractions. 
We thus obtain, the three boundary conditions (Lamb 1924 , §§325 and 327) 

tx* cos (x, v) + t xy cos (y, p) 4 - cos (z, v) = X v , etc. (13-1) 

where (x, v) y (y, v) and ( 2 , v) are the angles between the normal v to the surfaoe at 
the point (x,y, z) and the x 9 y and z axes respectively. 

Again we can substitute in (13*1) the expressions for the stress components given 
in equations (9*6) and so obtain the boundary conditions in terms of the components 
u , v and w of the velocity. 

As in classical hydrodynamical theory, the boundary conditions may not be given 
in the form of applied surface tractions, but as specified values of the velocity com¬ 
ponents u y v and w over the boundary. Again, the boundary conditions may be of 
the mixed type, in which specified values of the surface tractions are given over part 
of the boundary surface and specified values of the velocity components w, v and w 
over the remainder of the boundary. 


0. PROBLEMS INVOLVING LAMINAR FLOW 


14. Introductory remarks 


It can readily be seen that for laminar flow of the xz-planes of the fluid parallel 

to the x-axis, , 

.«*/(»). r = w « 0 . 


Equations (2*5) and ( 2 * 10 ) yield 

0 and h - u y (14* 1 ) 

and C" « 0, F'** G' =» H’ - 0. (14*2) 

Substituting from (14*1) and (14*2) in (9*6), we obtain 


*xx = tyv * + P> *zz ® P> ty* = f zx = 0 and f ry = ( 1 4 ’ 3 ) 

Thus, apart from an arbitrary hydrostatic pressure, the stress system associated 
with laminar flow consists of a tangential component and normal components t xx 
and t yjr These normal components vanish for a Newtonian fluid and indeed for any 
fluid for which T « 0 in equations (9*6). 

In §§ 15 and 16, we shall consider in greater detail certain problems of laminar 
flow which are of practical interest. 


15. Torsional motion of a cylindrical mass of fluid 

In this section the torsional motion of a cylindrical mass of incompressible fluid 
about its axis will be considered. Let us assume that the radius and length of this 
mass of fluid are a and l respectively. Let us assume that each point of the fluid 



274 


R. S. Rivlln 


moves in a circular path about the axis of the cylinder with a speed which is pro¬ 
portional to its distance r from the axis and its distance z from one end of the cylinder. 
Let us take as axes of reference, a rectangular, Cartesian co-ordinate system 
(x,y,z), which has its z-axis coincident with the axis of the cylinder and its x- and 
y-axes in the plane of one end of the cylinder. Then the components w, v and to of the 
velocity of a point of the fluid, which is at (x, y, z), parallel to the axes x, y and z of 
the rectangular, Cartesian co-ordinate system, are given by 

w = — i/ryz, v = ijrxz and w = 0, (15*1) 

where r 2 = x 2 + y 2 (15*2) 

and \jr is a constant. From equations (15*1) and (2*5), we see that for this motion 
of the fluid, 

a — 0 , 6 = 0 , c - 0 , / = \jrx, g = — ijry and h = 0. (15*3) 

Again, from equations (1&-3) and ( 2 * 10 ), we see that 

A' « B' = ^ *x 2 , C" « W 2 r\ F' = 0 , 

Q' = 0 and H ' = — \$ 2 xy. 

Further, employing the relations (15*3) in (4*8), we obtain 

and A ' 3 = 0. (15*6) 

Also, it is seen that the relation (5*2), which must be valid for an incompressible 
fluid, is automatically satisfied. 

If the stress-strain-velocity relationships take the form (9*6), we can find the 
stress components at the point (x,y,z) by introducing the relations (15*3), (15*4) 
and (15*5) into (9*6). We obtain k 

txx=V¥({fY) + P> 

tyy = 2'P(i*V)+J». 

t zz = 2'¥(W*r*)+p, 
t vs = 0 (^x), 

<« = ©(- i try) 

and t xy = 'Y(-\rxy)> 

where 0 and ¥ are now functions of K t only, K t being given by (15*5). 

Introducing these expressions for the stress components into the equations of 
motion (12-1), together with the formulae (16-1) for u, v and w, we obtain 

- pijflxz* = pX + 

-pt*yz* = pY+&- (15*7) 
and 0 «pZ+~. 



(15-4) 



275 


The hydrodynamics of non-Newtonian fluids. I 

Let us assume that pz % is small compared with V at all points of the fluid mass, so 
that the components of the centrifugal force —pijflxz* and —pijr^yz* in equations 
(16-7) can be neglected. If, further, the applied body forces are zero, then equations 
(16-7) become as a 

and | = 0. (15-8) 

It is readily seen that these equations are compatible and lead to the following 
equation for p: . 

(M) 


Thus, the state of motion described by equations (15*1) can be supported by 
surface tractions alone. 

Let us take at each point of the fluid a rectangular, Cartesian co-ordinate system 
(r, 0, z), the axes of which are parallel to the radial, azimuthal and axial directions 
at that point. Then, the orientations of the axes (r,0,z) with respect to the axes 
( x , y y z) are given by the scheme 

r 0 z 


x 

y 

z 


xjr —y/r 0 

yjr xjr 0 

0 0 1 


Therefore, the components of stress in the axial system (r,d,z) are 


given by 


and 


t -- t +^t + 2 -^t 

*rr — r 2 *** ^ r % l VV ^ r Z 




ho — r 2 hx + r a hfy r % hy> 

tzi “ 

X 


'0z *“* r r v zx> 

y A a? 

-*W + ~**x 




xy 


XV 


(15-10) 


Introducing into these equations the expressions (15-6) for t vy ,..., < xy , we obtain 


and 


t„ = p, 

t B » = w?, 
t a - 'W **)+p, ■ 

= r\/r@ 


(16-11) 


VoL 193. A# 


18 



276 


R. S. Rivlin 


Let us assume that when r m a, t„ «* 0 and hence *» 0. This enables us to determine 
the constant of integration in the integral of equation (15*9), giving 

p = ^aJ'rYdr. (15-12) 

Now, the surface tractions which must act on the plane ends and on the curved 
surfaoe of the mass of fluid to maintain the state of motion described by equations 
(15*1) are given by equations (13*1) and, for the particular problem considered here, 
could be calculated by introducing the expressions (15*6) for t xx > t vu ,... and writing 

cos ( x , v) = cos («/, v) = 0 and cos (z, v) = ± 1, 

over the plane ends of the cylinder, and 

cos (x f v) — x/r, cos (y, v) ** yfr and cos (z, v) » 0, 

over the curved surface of the cylinder. However, it is simpler to calculate directly 
the radial, azimuthal and longitudinal components, R\ & and Z ' respectively, 
directly by noting that over the curved surface 

R' « 0' « fa and Z f « t„ (15*13) 

and over the plane ends 

R' = t n , & = and Z' « t n . (15*14) 

Substituting in (15*13) and (15*14) the expressions for t rri t $$J ... given in equations 
(15*11), where p is given by (15*12), we see that, over the curved surface of the 
cylinder, the surface tractions vanish and, over the plane ends, they are given by 

R’ *> 0, @' = r^0 and Z' « WVV+fopjVrdr. (15-15) 

The couple whioh must be exerted on a plane end of the cylindrical mass of fluid is 

f 0'2 Ttr^dr = 2 mjr f r®0dr. (15-16) 

Jo Jo 

Sinoe 0 is a function of K 2 and K 9 whioh, for the motion considered, are given by 
(15-5), the expression (15-16) can readily be calculated if the form of 0 as a function 
of K t and K z is known. 

It is apparent from (15-15) that, in general, to produce the motion described by 
(15-1) it is neoessary to exert, in addition to the couple, a normal surface traction. 
The presenoe of the terms 2'¥A' t ..., K YF ',... on the right-hand sides of the stress- 
strain-velooity relations (9-6) does not alter the value of the couple (15-16) and the 
presenoe of the terms 20a, ..., 0 /, ... does not affect the value of normal surfaoe 
traction Z' given by (15-15). Therefore, experiments in which the dependence of the 
couple (15-16) on \jr is measured can only yield information on the form of 0 for the 
material and, sinoe K a 0 for the motion considered, only on its dependence on 



The hydrodynamics of non-Neiotonian fluids. I 277 

K t when K a = 0 . Again, experiments in which Z' is measured can yield information 
only on the form of 'F as a function of K t , when K a = 0 . 

In the simplest oase of equations (9-6), for which V+ 0,0 and *F are both constants. 
Then the normal surface tractions Z', whioh must be applied to the plane ends of 
the cylindrical mass of fluid, are given, from (15-15), by 

Z' = J'l j y*(3r 2 -a 2 ) (15-17) 

and the couple which must be exerted is given by (15-16) as 

hrijrQa 1 . (15-18) 

It is seen from (15-17) that Z' changes sign when r = a/ v /3, and is positive or negative 
when t = 0 accordingly as T is negative or positive. 


16. Flow between rotating infinite cylinders 


We shall now consider the motion of an incompressible fluid, whose stress-strain- 
velocity relationships take the form ( 9 - 6 ), between two infinite, coaxial cylinders 
of radii a x and a t respectively (a 2 >a x ), when the angular velocity of the outer 
cylinder is ft 2 and that of the inner cylinder is ft x . We shall assume that each point 
of the fluid moves in a plane perpendicular to the common axis of the cylinders 
and in a circular orbit about this axis. 

If the angular velocity of the fluid at a point distant r from the axis of the cylinders 
is ft, then ft = ft x , when r ■ o x , and ft = ft 2 , when r = a % . (16*1) 

It is noted that for the state of motion described, ft is a function of r only. Let us 
take as axes of reference a rectangular, Cartesian co-ordinate system (x, y, z), the 
2 -axis of whioh coincides with the axis of the cylinders. Then, if u, v and w are the 
components of velocity of the point (x, y, z) of the fluid parallel to x, y and z 
respectively, we have 

u — — yft, v = zft and w — 0 . (16*2) 


Introducing these expressions for it, v and w into (2-5) and noting that r* = x *+y a , 
we obtain 


-^ft,, 6 -^ft,, c = 0 , 


/ = 0 , (7 = 0 and h = -(# a —y*)ft r . 


(16*3) 


Substituting these expressions in ( 2 - 10 ), we obtain 

A' *> B' = *r*ft», C' - 0, F' = 0, O' - 0 and H’ = 0. (16-4) 

It is readily seen that the relation (5-2), which must be obeyed for any motion of 
an incompressible fluid, is automatically satisfied by (16*3) and hence for the motion 
described by (16-2). 



278 


R. S. Rivlin 


Substituting in (4*8), from (16-3), we see that for this motion of the fluid 

and JT S = 0. (16-5) 

Now, let us oonsider the case when the stress-strain-velocity relations for the 
material take the form (9-6). For the motion considered these become 


and 


t xx = -2@-~Q r + 2'¥±r*Ll*+p, 

t yv ~ 20^Q r + 2'FJr»O r i + p, 
<« 

tyi — tgx ~ ® 

t xv = %\{x*-y*)a T . 


(16*6) 


The components of stress defined as in §15, can be calculated by 

combining equations (16*6) with (15*10). We obtain 


hr « he = }TrK^+p, t a = p, 
0 and t r9 = ©r£l r 


hz — tar 




(16-7) 


Substituting the expressions (16-7) and (16-2) in the equations of motion (12-1), 
we obtain, when the body forces are zero (involving neglect of the gravitational 
forces on the fluid) so that X = }’ = Z = 0, 




and 
Since 

and 

we obtain, from (16*8), 

and 


dp 
dz * 


(16*8) 


dp ^xdp ydp 
dr ~ r dx + r dy 


dp 

To 


dp dp 

- y T* + X Ty’> 


<>«'#) 


| - -/.fi'-sttwas) 


dp 

W 




(16-10) 


If p is independent of 6, as it must be for the cylindrically symmetrical state of flow 
we have assumed, 3 pjdd = 0 and the second of equations (16-10) yields 


9 1 q ,-2 


(16-11) 


where C is a oonstant of integration. 



Tfye hydrodynamics of non-Newtonian fluids . I 


279 


It can readily be seen, by direct substitution in (16*8), that if 0-Q, is given by 

(16*11), equations (16*8) are compatible. If the form of 0 as a function of K % and K s 
and hence, from (16*6), of — is known, equation (16*11) can be used to obtain 
fl as a function of r. The integration involved introduces a further constant of 
integration and this, together with C in (16*11), can be obtained by employing the 
two boundary conditions (16*1). It is clear that the radial distribution of the 
angular velocity is unaffected by the presence in the stress-strain-velocity relations 
(9*6) of the terms containing Y. However, from the first of equations (16*10), it 
is seen that 

p ~ - \prWdr~ + C", (16*12) 

where C' is a constant of integration. 

Introducing expression (16*12) for p, the stress components (16*7) for the state 
of motion considered become 


trr = ho - -jprWdr+C, 

= -jprQHr -A 4VQ* + C', 


l 0z ‘ 


k-0 


(16*13) 


and t r& « ©r£l r . 

It is seen that the presence of the terms containing Y in the stress-atrain-veloeity 
relations (9*6) modifies the expressions for the stress components to the extent of 
the term — JYrW in the expression for t zs . 

If the form of the stress-strain-velocity relations is known, then (16*13) can be 
used to determine the necessary surface tractions that must be applied to the mass 
of fluid to produce the state of motion described by (16*2). 

If the angular velocity fi is sufficiently small then the terms - jpril 2 dr in (16*13) 

can be neglected. If the length of the cylindrical mass of fluid is finite, then the system 
of external forces which must be applied to maintain the state of motion considered 
consists of (i) azimuthal surface tractions acting over the inner and outer curved 
surfaces of the fluid mass of amounts Cja f and Cja\ respectively, (ii) a normal surface 
traction acting over the plane ends of the fluid mass of amount — JYC'*/@V 4 , and 
(iii) an arbitrary uniform normal surface traction <7\ acting over the whole of the 
surface of the fluid. 

The surface tractions (i) imply that in order to produce in the fluid the state of 
motion under discussion, equal and oppositely directed couples of magnitude 2ttC 
must be applied to the confining cylinders per unit length. It is noted too that the 
surface traction (ii) vanishes when Y * 0, for the state of motion considered. 

‘ For the particular case, when 0 is a constant, we find, from (16* 11) and (16* 1), that 

C - 2©(Q x -Q»)^M 5 > 


(16-14) 



280 E. S. Rivlin 

so that the surface traction (ii) becomes 

If T is constant, this is positive or negative depending on the sign of *F. If the 
surface traction (16-15) is not applied, the surface of the fluid will have a tendency 
to rise or fall at the surface of the inner cylinder accordingly as T is positive or 
negative. In either case, the effect will be greatest at the inner confining cylinder. 


17. Genkrai.. remarks 

A fluid is generally recognized as non-Newtonian on the basis of visoosimetric 
experiments. For a Newtonian fluid in a steady state of laminar flow, a linear 
relation obtains between the shearing stress and rate of shear. It is generally assumed 
that if such a relation is obtained experimentally, the fluid under investigation is 
Newtonian and that the whole of the classical hydrodynamic theory for viscous 
fluids is applicable. It is seen, from the results of §§ 15 and 16, that this is by no means 
correct. Fluids, which appear Newtonian on the basis of experiments involving 
steady-state laminar flow, may, from the stand-point of the phenomenologioal 
theory, be non-Newtonian and distinguishable from Newtonian fluids by other 
steady-state experiments in which the flow js not laminar. 

It should be remarked that any visco-elastic material, which is fluid-like in its 
ability to be continuously deformed, cannot be distinguished from a visco-inelastic 
fluid solely by means of steady-state experiments, in which the flow is laminar. 
For example, suppose the fundamental stress-strain-velooity relations for the 
material involve time derivatives of the stress components. For a steady state of 
flow, these vanish and the stress-strain-velocity relations are indistinguishable from 
those describing a true fluid. 

It may well be that the theory formulated in the present paper may have as an 
important field of application the correlation of the steady-state laminar flow 
properties of suoh visoo-elastic materials. 

Effects similar to those predicted from the theory of §§ 15 and 10 have been 
described in a number of fluids by Weissenberg ( 1947 ). They are interpreted by him 
as arising from the fact that the fluids concerned have highly-elastic as well 
as visoous properties. It does not, however, appear that this explanation can 
be correct, from a phenomenological point-of-view, since, in the experiments, the 
fluids are in a steady-state of laminar fldw. In the theory presented here, these 
phenomena appear to arise from the non-vanishing of'F in the stress-strain-velocity 
relations. From a phenomenological point-of-view, this does not imply that the 
fluid is visoo-elastic. However, it may be that the miorosoopio or molecular features* 
of a fluid, whioh lead to a non-zero value for T, also result in the fluid having visoo- 
elastio properties. 



281 


The hydrodynamics of ncm-Neivtonian fluids . I 

This work forms part of a programme of fundamental research undertaken by 
the Board of the British Rubber Producers 5 Research Association. The work was 
carried out at the National Bureau of Standards, Washington, D.C., U.S.A., while 
the author was a guest-worker there. 

References 

Lamb, H. 1924 Hydrodynamics , 5th ed. Camb. Univ. Press. 

Love, A. E. H. 1927 The mathematical theory of elasticity , 4th ed. 

Camb. Univ. Press. 

Reiner, M. 1945 Amer. J . Math, 67, 350-362. 

Weissenberg, K. 1947 Nature , 159, 310. 


A theory of the yielding and plastic flow of 
anisotropic metals 

By R. Hm. 

(Communicated by E. Orowan, F.R.S.—Received 13 November 1947) 


A theory is suggested which describes, on a macroscopic scale, the yielding and plastic 
flow of an anisotropic metal. The type of anisotropy considered is that resulting from preferred 
orientation. A yield criterion is postulated on general grounds which is similar in form to the 
Huber-Mises criterion for isotropic metals, but which contains six parameters specifying the 
state of anisotropy. By using von Mises’ concept (1928) of a plastic potential, associated 
relations we then found between the stress and strain - increment tensors. The theory is applied 
to experiments of Kdrber & Hoff (1928) on the necking under uniaxial tension of thin strips 
cut from rolled sheet. It is shown, in full agreement with experimental data, that there are 
generally two, equally possible, necking directions whose orientation depends on the angle 
between the strip axis and the rolling direction. As a second example, pure torsion of a thin- 
walled cylinder is analysed. With increasing twist anisotropy is develoj>od. In accordance 
with reoent observations by Swift (1947), the theory predicts changes in length of the cylinder. 
The theory is also applied to determine the earing positions in cups deep-drawn from rolled 
sheet. 


1. Introduction 

The macroscopic yielding and plastic straining of isotropic duotile metals has been 
described by theories which may be regarded as essentially satisfactory. The well- 
known Huber-Mises criterion for determining the elastic limit under combined 
stresses of metals which deform homogeneously, for example, copper, aluminium, 
niokel, has been confirmed by many experiments over the last twenty years. There 
are one or two exceptions, notably annealed mild steel which softens after yielding. 
The speoimen is traversed by Lttders’ bands and the deformation is non-uniform. 
In such circumstances the maximum shear-stress criterion of Coulomb or Tresoa 
is found to be a better approximation than that of Huber-Mises. This is hardly 


282 


R. Hill 


surprising since each Liiders’ band is a region where the deformation is a simple 
shear. For metals obeying the Huber-Mises law of yielding, the relation between 
the plastic strain-increment and the applied combined stress is given by the L4vy- 
Mises equations. Strain-hardening can also be taken into account in a relatively 
simple way which is fairly satisfactory for biaxial stress systems and limited strain 
ranges. 

With increasing strain, however, a preferred orientation of crystal planes and 
directions gradually develops, and the individual crystals become elongated to 
form a characteristic fibrous texture in the direction of the most severe tensile strain. 
In this way an originally isotropic metal becomes anisotropic in respect of many 
physical properties. It is well known that the fibre texture produced in the techno¬ 
logical forming processes, rolling, drawing, and extrusion, is sometimes the cause 
of undesirable properties in the final product. The most familiar example is perhaps 
that of the ‘earing' of cups deep-drawn from rolled sheet. After severe cold-rolling 
the yield stress may be as much as 10 % higher in directions across the fibre than it 
is along the fibre (Cook 1937 ). Ductility shows even more pronounced directional 
properties. Suoh anisotropy can only be removed with difficulty by careful heat 
treatment. More often the metallurgist must be content with a compromise in which 
a final preferred orientation of allowable proportions is retained. Preferred orienta¬ 
tion is not the only cause of anisotropic plastic properties: laminar inclusions and 
cavities occasionally produce similar effects. Residual or internal stresses are 
another cause. The present theory is, however, probably valid only when the 
anisotropy is due to preferred orientation. 

Anisotropy, then, is not to be considered a phenomenon of rather rare occurrence. 
It is difficult to avoid in metal working and is invariably developed by any severe 
strain. Whenever it is present the theories of plastic flow for isotropic metals are 
only valid to a first approximation. This approximation is good enough for many 
purposes, but there are also many phenomena for which these theories fail to 
acoount. The object of the present paper is to formulate a theory capable of de¬ 
scribing the macroscopic behaviour of anisotropic metals. Such a theory must be 
constructed on a priori reasoning since no suitable data are as yet available to check 
individual assumptions. It will be shown finally, however, that the theory is, as 
a whole, in close agreement with certain experimental observations. 


2 . The plastic potential 

In 1928, von Mises, in a brilliant paper on the plastic distortion of crystals, 
introduced the fruitful concept of a plastic potential. A detailed account has been 
given by Geiringer ( 1937 ). This concept will now be explained as it is basic in con¬ 
structing the new theory. 

Suppose the criterion of yielding under combined stresses is 

** constant, 


(1) 



A theory of the yielding and plastic flow of anisotropic metals 283 

where/is regarded as a function of the components of the stress tensor. The function 
/will also involve certain parameters characterizing the current state of the material. 
Von Mises suggested that the relation between the stress tensor and the natural 
strain-increment tensor de^ which should be used in conjunction with this yield 
criterion is simply * - 

de^^-dX. ( 2 ) 

Here dX is a positive scalar factor of proportionality. When carrying out the partial 
differentiation it is important to remember that (i+j) must be counted as 

different for the purposes of tensor analysis. If the metal is isotropic / will not depend 
on the choice of axes, and equation ( 2 ) is then a true tensor relation. When the metal 
is anisotropic the form of/depends, as we shall see, on the choice of axes of reference. 
The function/is called the plastic potential. 

Let us examine what validity von Mises’ idea has in relation to the known plastic 
properties of single crystals and quasi-isotropic polycrystals. The Huber-Mises 
criterion of yielding for the latter is 

2/(0^) S (<r*, - (Tyyf + (< 7 V1/ - <rJ* + (cr a - O* + 6 (<r^+ (r \ e +O = 27*,. (3) 

where Y is the yield stress of a bar in uniaxial tension. Regarding the nine stress 
components as distinct independent variables, we find 

- ( 2er **-°w-o> ^c. 

1 u xjt vu yx vu zy 

Now the L 6 vy-Misea stress-strain relations are 

de xx >= dA( 2 <r xa .~o- 1 /i/ -cr w ), de uz = 3 dXo yst etc., (4) 

where de xx , de yai etc. are components of the natural strain-increment tensor. Hence 
equations (4) can be written in the form of equation (2). 

Consider next a metal single-crystal which is generally accepted as yielding when 
the shear stress, acting over a certain crystal plane in a certain specific direction, 
reaches a limiting value. If axes are chosen so that the shear stress is cr yz , then the 
criterion of yielding is/(<r^) s<r ya = constant. From equation ( 2 ) we find that the 
strain-increment tensor has only two non-vanishing components viz. de yz and de ty , 
which are of course equal. This represents a shear in the yz plane in the direction of 
the shear stress, which is what is actually observed. 

In annealed mild steel yielding begins when the maximum shear stress reaches 
a limiting value. Proceeding as with the single crystal, equation ( 2 ) implies that the 
ensuing strain is a shear in the direction of the maximum shear stress. This is just 
what happens within a Ltiders’ band. Prestrained mild steel, however, is like most 
ductile metals xx\ following both the Huber-Miaes and L^vy-Mises laws (Taylor & 
Quinney 1931 ). 

The success of the plastic potential in these examples gives confidence in applying 
it to anisotropic polycrystalline metals. Von Mises himself was concerned with 



284 


E. Hill 


single crystals. However, the idea seems of wider validity, for in spite of its apparent 
artificiality it can be given a physical interpretation. This, too, is dne to von Mises. 
Consider the work o-^de^ done on a small element in deforming it through a given 
plastic strain-increment de^. Suppose we arbitrarily vary tr if , subject only to the 
restriction imposed by the yield criterion f(cr it ) = constant. Then for given de^ 
the work done has a stationary value when er (} assumes its actual value given by 
equation ( 2 ). Stationary values of <r {j de i} , subject to f{cr i} ) — constant, ate given, 
in the usual way, by g 

g— (cr i ]de i ]~ fdX) = 0 , 

where dX is an undetermined constant multiplier. This leads immediately to equation 
( 2 ) and the statement is proved. In the special case when / is the Huber-Mises 
expression the work done has a maximum value (Hill, Lee & Tupper 1947 ), but this 
may not be true in general. It is true for the yield criterion proposed in § 3, except 
possibly when one of F, G, H is negative. 

3. A YIELD CRITERION FOR ANISOTROPIC METALS 

It is assumed that the anisotropy has three mutually orthogonal planes of sym¬ 
metry at every point. The three planes of symmetry meet in three orthogonal 
directions which may be called the principal axes of anisotropy. These axes can, 
arid frequently will, vary in direction at any moment from point to point of the bulk 
metal. For example, if anisotropy is developed through the severe uniform expansion 
of a hollow cylinder by internal pressure, the principal axes of anisotropy can be 
expected to lie in the radial, circumferential and longitudinal directive. An example 
of uniformly directional anisotropy would be that in a bar cut from the central part 
of a cold-rolled sheet; the principal axes would lie in the direction of rolling, trans¬ 
versely in the plane of the sheet, and normal to this plane. Again, the principal axes 
in a given element can vary relatively to the element itself as deformation continues, 
for example, in simple torsion of a hollow cylinder. 

Let us fix our attention on a given element in a certain state of anisotropy and 
ohoose the principal axes of anisotropy as Cartesian axes of reference. By analogy 
with the Huber-Mises yield criterion for isotropic metal it is natural to select some 
homogeneous quadratic in the stresses to represent the plastic potential. This would 
imply that the yield stress in tension for any direction is the same as that in com¬ 
pression, i.e. the absence of a Bauschinger effeot. Now cold work does generally give 
rise to a Bauschinger effect, just as it produces a preferred orientation and fibre 
texture. The former can be removed by mild annealing, but the latter remains. 
Hence it is perfectly possible, both in principle and in practice, to have an anisotropio 
metal with no Bausohinger effeot. The Bausohinger effeot will be ignored in the 
present analysis; it will not be difficult to take aocount of it when experiment shows 
such a refinement in the theory to be desirable. We can, then, safely choose a homo¬ 
geneous quadratic for the plastic potential. Furthermore, in view of the symmetry 



A theory of the yielding and plastic flaw of anisotropic metals 285 

assumption, terms in which any one shear stress occurs linearly must be rejected* 
If it be also assumed that the superposition of hydrostatic pressure does not influence 
yielding, the plastic potential or yield criterion takes the form 

2/23 - O a + G((7 w ~ (T^)* + H{&xx - Vyv)* + 2 ^<4* + -f 2No* v » 1. (6) 


F t 0, H y L, M, N are constants characteristic of the current state of anisotropy. 
The expression for / only has this form when the principal axes of anisotropy are 
chosen as axes of reference. If other axes of reference are taken, the form changes 
in a way that oan be found by transforming the stress components. 

If X t Y t Z are the tensile yield stresses in the principal anisotropic directions, it 
is easily seen that , ill 

ji= 0+H > 2 F =t* + &-M 


7» 


= H + F, 


20 = 


_L _i 

Z 3+ X* 


jL 


( 6 ) 


_L 

Z 2 


F+Gy 



JL_J_ 

Y 2 Z 2 ’ 


It is clear that only one of F , Gy H oan be negative, a situation which, however, is 
unlikely to occur in practice, since it implies rather large differences between the 
yield stresses. Also F > G if and only if X > Y , etc. These inequalities will be used 
later when comparing theory and experiment. If R, 8, T are the yield stresses in 
shear with respect to the principal axes of anisotropy, then 

2M-±-„ ( 7 ) 


L, M, N are thus essentially positive. 

To desoribe fully the state of anisotropy in an element the six independent yield 
stresses, X, 7, Z, R, S, T need to be giveo.'They must be considered as dependent 
on the strain-history of the material and so varying with strain. For present purposes 
it is not necessary to specify the state of anisotropy more closely than this. No 
attempt will be made to examine, for example, how the yield stresses change with 
increasing distortion, or how they can be quantitatively related to the degree of 
preferred orientation. 

If there is rotational symmetry of the anisotropy in an element about the z axis, 
then F~Q t N = 0 + 2H. (8) 


This may be verified by requiring the form of / to remain invariant for arbitrary 
x, y axes of reference. For complete spherioal symmetry, i.e. isotropy, F = 0 * H 
and L - M ■> N - 3 F. The expression (5) is then identical with (3) when 2F is 
put equal to 1/ 7*. 

It might be asked why /is not taken as the expression for the distortional energy 
due to shear alone in an anisotropic elastic medium, by analogy with Hencky’s 



286 


R. Hill 


interpretation of the Huber-Miaes yield criterion for isotropic materials. There are 
two objections to this procedure. A hydrostatic pressure generally produces shear 
strain in an anisotropic elastic material. Hence we cannot take the expression for 
shear-strain energy as the plastic potential without implying that a superposed 
hydrostatic pressure influences yielding. Again, the anisotropic shear-strain energy 
is not independent of elastic constants as is the isotropic shear-strain energy, apart 
from a multiplying factor. The elastic constants will therefore appear in the expres¬ 
sion (2) for the plastic strain-increment. This is of course not physically plausible. 

4. Relations between stress and strain-increment 

FOR ANISOTROPIC! METALS 

By applying the equation (2) to the yield criterion (5) the following expressions 
are obtained, relating the components of the strain-increment tensor to the stress 
components, 

de^ = dA[H(o xx -o vv ) + G(o xx -o a )], de yz = dA Lo yx , ' 

de yv = dA[F(cr vl/ -cr„) + H(o yy - o xx )], de, x = dA Mo - (9) 

dt zz = dA[G(o tz -o xx ) + F(o sl -o yy )], de xy = dANo^., 

It will be noticed that de xx + de m + de.. — 0 identically for all stress systems. This 
of course is essential in view of the known fact that the volume change associated 
with plastic strain is negligible. The zero volume change is directly connected with 
the assumption that yielding is independent of hydrostatic pressure. Either is a 
consequence of the other for any yield criterion whatever. This can most readily be 
seen by considering the stationary work interpretation of the plastic potential 
method. For a superposed hydrostatic pressure does not change the work done in 
a given strain when the volume change is zero. Thus if we acoept, as experimental 
facts, the zero plastic volume change and the independence of the yield criterion 
with respect to hydrostatic pressure, then the method of the plastic potential is 
not only consistent with botk but provides a link between them. 

When using the above equations it must be remembered that they only have this 
form when the stresses and strain-increments are referred to the principal axes of 
anisotropy. Further, the principal axes of stress and strain-increment coincide 
when the principal stress directions are along the principal axes of anisotropy, but 
not in general otherwise. 

Consider now some simple stress systems for a material with anisotropy uniformly 
distributed in magnitude and direction. In pure tension X parallel to the principal 
x axis of anisotropy the incremental strains at a oertain stage are 

d€ xx :de vv -.d€ u »(G + H):-Hi-G. 

The specimen contracts in the transverse directions unless one of G and H is negative, 
which seems hardly likely to occur. The contraction in the y direction is greater 
than that in the z direction if H > G, i.e. if Z > Y. An experimental determination 



A theory of the yielding and plastic flow of anisotropic metals 287 


of the ratio de m lde„ would give the value of HjO. Similarly from tension tests in 
the y, z directions values of G/F, F/H can be found. Apart from the check on the 

HOF 

theory in view of the identity ^ x x — = 1, this would allow an alternative way 

of determining the ratios XjY, YjZ of the yield stresses. If the yielding is not 
particularly sharp, direct measurement of X, Y, Z would be unreliable. Similar 
experiments can, in principle, be devised to find the shear-stress ratios £/8 t RjT. 

In plane strain, where motion is prevented in the z principal direction of anisotropy, 
we find by putting de a = 0 in equation (9) that 


and thus from equation (5) that 


Ocr xx + F(r yv 
O + F * 




( 10 ) 


It is easy to show that the tensile yield stresses are equal in any pair of orthogonal 
directions in the plane, although cr zz is different in the two cases. It can also be shown 
that the dependence of the tensile yield stress on orientation is such that maxima 
or minima occur in directions along the anisotropic axes, or in directions making 
46° with the axes. If wa+OH + BF) , 

~S{F+0) <I ' 

the yield stress has a minimum in the 45° directions and a maximum in the x and y 
directions; for the opposite inequality the reverse is true. When the equality holds, 
the yield stress does not vary with orientation. In view of the identities 


2(FG + GH + HF) 
N(F + 0) 

it is clear that when 

N>F + 2H, 


F + 2H + F(0-F) 


N 


0 + 2H G(F-Q) 

“ + 


N(F + G) N N(F + 0)' 


0 + 2H, then 


2 {FG+GH+HF) 
N(F + G ) 

2 (FG + GH + HF) 


<i; 


>i. 


and when N<F+2H, G + 2H, then ...„ 

N(F + G) 

The physioal significance of these inequalities will be further exemplified in § 5. 

The relations between the components of the strain-increment and stress tensors 

in a state of plane strain are , , , n 

r de^ + df. yv ~ 0, 

2(FQ + GH±HF) {<r tx -v yy ) 


and 


d^xx-de^ 


( 11 ) 


2de xv N(F + G) 2<r Xr . 

Hence if \Jr is the angle between a principal stress direction and the x axis, and \jr' 
is the angle between a principal strain-increment direction and the x axis, then 

tan 2^ 2(FG+GH + HF) 


tan 2\J/' 


N(F + G) 


( 12 ) 




288 


Hence y!r = \jr' if, and only if, ifr = 0, ± ±n, ± \n. Thus the principal axes of stress 
and strain-increment coincide in the directions in which the tensile yield stress has 
maximum or minimum values. The Bet of equations (10) and (11) together with the 
equilibrium equations, considered in terms of the five dependent variables (three 
stress components and two incremental displacement components) is hyperbolic. 
There are only two distinct characteristics: these are the orthogonal directions of 
maximum shear strain-increment or of zero extension. These, as has been remarked, 
are not generally the directions of maximum shear stress. The four differential 
relations holding along the characteristics are found to be analogous to the well- 
known Hencky relations for an isotropic metal. It is scarcely worth while to include 
an account of these in the present paper while the theory has not yet been sufficiently 
tested. 

For a thin sheet perpendicular to the Z direction and in a state of plane stress 


(Q+H)cr •, - 2Har xx <r yy + {H + F)<r\ v + 2Nar% v - 1. (13) 

The maxima and minima of the yield stress in uniaxial tension occur along the 
anisotropic axes and in a direction # with respect to the x axis, where 


tan*# 


N-0-2H 

N-F-2H' 


(14) 


If N > F + 2H, O + 2H, the yield stress has maximum (unequal) values in the x, y 
directions and minimum (equal) values in the # directions.* If N < F +2 H, Q +2 H, 
the converse is true. If N is intermediate to F + 2H and 0 + 2H there is no real # 
satisfying equation (14). The yield stress then has a maximum in the x direction and 
a minimum in the y direction if F> 0, and vice versa. The relations between the 
components of the strain-increment and stress tensors in a state of plane stress are 

de xx ^ (Q + 

dCyy di)(Tyy —H<T xx (16) 

de xx ~ d€m _ (G + 2H)cr xx —(F + 2H)<r w 
2de je y 2N<r xv . 

The principal axes of stress and strain-increment coincide when the principal axes 
of stress are along the anisotropic axes, and also for special states of stress suoh that 

O’ yy N-0-2H 

<r xx ~N-F-2H‘ 

If the state of stress is a uniaxial tension <r directed at an angle # with the x axis then 

CTt* *= coos*#, (Tyy « <rsin*#, cr^ =* a-sin# cos#. 

The principal axes of stress and strain-increment then coincide when # has the value 
given by equation (14). # is real only when {N—F—2H) and (N—0—2H) have the 


* Compare values for 0*1 % proof (tress given by Cook (foe. oil., tables I and H). 



A theory of the yielding and plastic flow of anisotropic metals 289 

same sign. The directions for which the principal axes of stress and strain-increment 
coincide in uniaxial tension are thus those for which the yield stress has maximum 
and minimum values. 

The characteristics for a state of plane stress are in directions satisfying 

[(C? -h ^ + 2 ^^^^ -h [(JP 4 - JBf) cr w - rfy* - 0 . (16) 

The characteristics are in the directions of zero extension in the plane of the sheet. 
They are not generally perpendicular because of the de n strain, nor are they always 
real. The plane stress problem is thus sometimes hyperbolic and sometimes elliptic, 
just as in the case of an isotropic metal. The coincidence of the characteristics with 
the directions of zero extension in both plane strain and plane stress is closely 
connected with the plastic potential. In these special oases the characteristics can 
be found for the Cauchy problem in the stresses and the incremental displacements 
separately, since the full set of equations breaks down into distinct halves. The 
two pairs of characteristic directions corresponding to the two halves can be shown 
to be the same if, and only if, the stress-strain equations and the yield criterion are 
connected on the plastic potential basis. 

5. The necking of strip under tension 

A check on the present theory can be obtained by analyzing the experimental 
data of Korber & Hoff ( 1928 ). These workers carried out tensile tests on strips cut 
from thin rolled sheet. Now it is well known that a sufficiently thin strip of isotropic 
metal, pulled in tension, develops a neck making an angle of about 55° with the 
direction of pull. Korber & Hoff, however, observed that for a strip cut at a definite 
angle to the rolling direction either of two distinct necking angles occurred apparently 
at random. Suppose a strip is cut with its length making an angle a with the rolling 
direction in the original sheet (figure la). In figure 16 the strip is shown with the 
two equally likely directions of necking. 


possible necks 




Figure 1, Positions of th© two possible necks in a flat strip 
cut from rolled sheet and pulled in tension. 

Korber & Hoff plotted graphs giving the two necking angles as functions of a. 
Figure 2 indicates qualitatively the general trend of their results; in detail the graphs 
depended on the material used and reduction in rolling. 


200 R. Hill 

The ordinate scale represents the acute angle between the line of the neck and 
the strip axis; the abscissa measures a, which of course ranges from 0 to \n. It is 
convenient to distinguish the two possible necking directions according as they 
tend to lie with the fibre (i.e, the rolling direction) or across it. The full curve in the 
figure corresponds to necking across the fibre; the dashed curve corresponds to 
necking with the fibre. It will be seen that there are three values of a for which the 
two possible necks make the same angle with the strip axis, thus giving the graph 
a figure of eight appearance. In the main sheet the principal axes of anisotropy are 
along and transverse to the rolling directions, and normal to the sheet. Hence the 
equality of the necking angles when a is either 0 or \n is what would be expected 
from symmetry considerations since the axes of anisotropy are then parallel to the 
sides of the strip. The reason for the coincidence of the angles for a certain inter' 
mediate a will become clear when we analyze the experiments theoretically. 



Figure 2. Qualitative representation, of Kflrber <fe Hoff's measured values of the necking angle 

as a function of the angle a between strip axis and rolling direction. - across fibre; 

-with fibre. 

To begin with, consider a flat strip of isotropic metal strained in tension. Necking 
will begin after some preliminary extension at a point where there is a slight non¬ 
uniformity in the strip, either geometrical or structural. In view of the property of 
characteristics as directions along which small disturbances propagate, it is natural 
to suppose that the line of the neck will coincide with one or other of the two cha¬ 
racteristics through the origin of the disturbance. If this origin lies not on the edge, 
but in the middle of the bar, a V -shaped neck is sometimes observed, with its branches 
coinciding with parts of both characteristics. We can immediately find the directions 
of the characteristics from equation (16) for a state of plane stress. Take the x axis 
in the direction of pulling and set F » 0 «* H « in view of the assumed isotropy. 

The directions of the characteristics are found to make an angle tan* 1 ^2, or approxi¬ 
mately 55°, with the strip axis. It has already been mentioned that the incremental 
extension in the characteristic directions is zero. The extension along the line of 
the neok is thus zero, and the neck forms by thinning in the thickness of the strip. 



A theory of the yielding and plastic flow of anisotropic metals 291 

Orowan, in unpublished work on the pulling of isotropic strip, takes as a starting- 
point the assumption that necking will begin along a direction of zero extension; 
the final result for the necking angle is therefore the same. Korber & Siebel ( 1928 ) 
have also obtained this value by an entirely different argument which is, however, 
unconvincing. 

When the strip is anisotropic and the anisotropy is uniformly distributed, let the 
x principal axis of anisotropy be taken along the fibre or rolling direction (figure 3 ). 
The x axis makes an acute angle a with the strip axis or pulling direction. The axis 
of z is as'usual normal to the plane of the strip (i.e. to the plane of the paper in 
figure 3). 



Figube 3. Co-ordinate system for analysis of the necking of flat strips in tension. 


measured from the axis in a sense away from the rolling direction denotes the 
inclination of a possible necking direction. Tt is the numerical value of 0 that is 
plotted in figure 2 . Except for material that has received very little cold work by 
rolling, the extension prior to necking is small compared with the rolling strains, 
and so the state of anisotropy in the strip at the beginning of necking will be effec¬ 
tively the same as in the rolled sheet. Let f\ G , H , L t M, N t as in § 2 , denote the 
anisotropic parameters specifying this state. If cr is the tensile stress in the strip 
when necking begins, the stress components referred to the x , y axes are 


cr xx » ercos 2 #, <7 


w 


crsin a a, cr xy = <7 sin ot cos a. 


Inserting these in equation (16) with dyjdx = tan( 0 -fa) we obtain after some 
reduction: «tan *0 + 6 tantf-c = 0 , (17) 

where 


a * i/ + sin®a cos* a[(2V -F-2H) + (N-G-2H)], 

b = 2 sin a cosa[(iV-f’-2//)8m l a-(Ar-G ( -2/i()coH*a], 

c = F sin® a + Q cos® a + H + sin®a cos*a[(iV -F-2H) + (N-G~2H)]. 


By using (fi) c can be shown to equal ] /<r® and is therefore essentially positive for 
all values of a. a is positive except when 2 N < F + G. Since N = 2F = 3<? for an 
isotropic metal, the possibility of a being negative is most unlikely and can be 


Vol. 193- A. 


»9 



292 


R. Hill 


disregarded. There are therefore in general two distinct necks with different inclina¬ 
tions corresponding to the roots of this quadratic in tan 0. The roots are numerically 
equal, but opposite in sign when h » 0 , which is true for u « 0 , %n, and for an 
intermediate value a given by 


tan 2 a =* 


N-G-2H 

N-F-2H' 


(18) 


For these three values of a the two possible necks are symmetrically situated with 
respect to the strip axis. Since the neck is a direction of zero extension the principal 
axes of strain-increment are along and perpendicular to the axis for these three 
a-values, and so coincide with the principal stress axes. The theory thus predicts 
the figure of eight pattern of figure 2. For a to be real, (N — G — 2 H) and (N — F — 2 H) 
must have the same sign and this condition must presumably have been satisfied for 
the materials used by Korber & Hoff. Further, if (N-F — 2H) and (N —G — 2H) 
are positive, the coefficient b is negative for a < a and positive for a > a. Hence the 
sum (— bja) of the roots is positive for a < a and negative for a > a. This means that 
the angle for necking across the fibre is numerically larger than that for necking 
with the fibre when a < a, and vice-versa when a > a. This agrees with the relative 
positions of the full and dashed curves in figure 2 . If (N ~ F - 2/7) and (N — G — 2H) 
are negative the curves are interchanged. We must therefore presume further that 
the anisotropy of the materials used by Korber & Hoff is such that 

N>F + 2H } N>G + 2H . 

That there is nothing intrinsically impossible in these inequalities may be realized 
by remembering that N = F + 2 H = G + 2 II when there is rotational symmetry 
of the anisotropy about the z axis. 

The final feature of figure 2 which must be verified theoretically is that the general 
trend of Korber & Hoff’s data was such that the figure of eight sloped downward 
from a = 0 to a « \n . Suppose 6 X is the value of 6 when a * 0 , and 0 2 the value when 
a * Then Q F 

tan 2 tan 2 0 g =F 


By using the equations (fi) it can be shown that the following possibilities arise, 
depending on the relative magnitude of the yield stresses X , Y, Z: 

(i) If Z > X, Y then 0 t < 0 V according as X $ F. Both 6 X and 0 a are less than 55°. 

(ii) If Y > Z > X then 0 X > 55° > 0 a . 

(iii) If X > Z > Y then d x < 55° < 0 a . 

(iv) If X, Y > Z , then, if H is positive, 0 8 J 0 X according as X $ F. Both d v 6 % are 
greater than 55°. If H in negative the 0’s are imaginary and expansion occurs in 
the transverse direction; according to the theory necking should be inhibited. 

It is found for many rolled metals that after severe reductions the yield stress 
in the rolling direction is less than that in the transverse direction. In particular 
this was observed by Korber $ Hoff for the metals used by them. In the present 



A theory of the yielding and plastic flow of anisotropic metals 293 

notation x is the rolling directionand so Y>X. The magnitude of Z was not 
measured. Now Korber & Hoff found 6 X > 0 2 , and this, according to the above 
analysis, is required by Y > X. Their values of 0 V 6 % were distributed haphazardly 
about 55° and so the relative magnitude of Z must have varied similarly. 

The theory is thus in agreement with the experimental data at all points where 
a test is possible. Closer quantitative agreement could not be checked without 
a knowledge of the absolute magnitudes* of the anisotropic parameters. 

* 6. The effect of anisotropy developed in a torsion test 

Consider a torsion test performed on an initially isotropic, thin-walled, cylinder. 
Let us fix attention on a small element of the wall. In figure 4 a the plane of the paper 
is to be regarded as tangential to the cylinder wall; ^L4' is the direction of shear 
and BB* is the axial direction. 



Figure 4. Deformation in pure torsion. 


Aft^r twisting, the initially square element shears over into a parallelogram 
(figure 46). An arbitrary direction LOU fixed in the element rotates into a new 
position and simultaneously suffers a resultant extension or compression. For a 
given total shear it is clear that there will be some initial direction which undergoes 
most compression, and also one which undergoes most extension. A fibre structure 
will thus develop, and the principal axes of anisotropy in the plane of the wall at 
any moment can be expected to coincide approximately with the final positions of 
the directions of greatest extension and compression. These can be determined by 
a simple construction (figure 5). TT f is the direction of shear; ON, which can be 
taken of unit length, is the perpendicular on TT* from the centre 0 of an element. 
For a given total shear-strain y there are two directions which undergo no resultant 
strain. One of these is clearly parallel to OT. If DD* * y and Z>, D* are reflexions 
in 0N t then OD is the initial position of the other and 01)* is its final position. This 
direction undergoes compression and then extension. Now lot OC , OE be the in¬ 
ternal and external bisectors of the angle between OT and OD . It is easy to prove 




294 


R. Hill 


that OC is the initial position of the direction of greatest compression and OE is 
that of the direction of greatest extension. These directions are mutually orthogonal. 
The final position of OC is OC f where C' is the reflexion of E in ON: the final position 
of OE is OE' where E ' is the reflexion of C in ON. 0E\ OC' are of course the bisectors 
of the angle between OT' and 0D\ Further, OC ' =» DD* = EE f = y. We see there¬ 
fore that, as the twist increases, the principal axes of anisotropy 0E' t OC' rotate 
from positions making 1 n with ON, TT' towards the limiting positions 0N y OT\ 
and further that they vary relatively to the element itself. The degree of the aniso¬ 
tropy will also vary with progressive twisting, though not necessarily monotonically 
because of a possible complex Bauschinger effect due to alternate compression and 
extension. 

- 



Fiuurk 5. Construction for directions of greatest extension and compression in pure torsion. 


axis 



Kiguhe 6. Co-ordinate system for analysis of the torsion of an anisotropic tube. 

Suppose that at some stage during the torsion the x, y axes of anisotropy make 
an angle 0 with the direction of twist and the cylinder axis (figure 6). As has been 
shown 0 increases steadily from {n towards \n. The x, y axes correspond respectively 
to C’O and OK' in figure 5. Oy thus marks the fibre direction. If r is the shear stress, 

(r xx » -(T yy = — t sin 26 , <r xy * r cos 2 6. 

All other stress components are zero. The value of r for which flow will continue is 
r « [(F 4- G + 4 H) sin* 26 + 2 N cos* 20J~*. (19) 




A theory of the yielding and plastic flow of anisotropic metals 295 

The anisotropic parameters continually change in magnitude during the twisting 
because of work-hardening and varying anisotropy. The strain-increments are 

de sx - ~-dA(G + 2H)T8in20 f 
de uy - dA(F + 2H)TBin20 ) 

h (20) 

~ dA(G — F)Tsir\20, 

(k xy = dAN . r cos 20. 

There is thus thinning or thickening of the cylinder wall. The strain-increment de A 
in the axial direction is obtained by tensor transformation of the elements in ( 20 ): 

de A = dA.r sin 2#[(Af ~ G - 2H) sin 2 ~ (A T -~ F — 2 //)cos 2 0 J. ( 21 ) 

With an ideally isotropic cylinder both the wall thinning and axial extension are 
of course zero. 

Just after twisting is begun, when 0 is a little larger than £tt, the axial strain- 
increment has the sign of (F— G) or of (X — Y). In §5 it was seen that in heavily 
rolled metals the yield stress along the fibre is usually less than that across the fibre. 
For small strains, when the anisotropy is still small, there seems to be no general 
rule. The cylinder may therefore shorten or lengthen initially according to the metal 
used. If for larger angles of twist we are entitled to assume as in § 5 that N > G + 2H , 
then de A is finalfy positive. The most that can be said at present, then, is that 
shortening or lengthening of a torsion specimen, or oven reversal of axial strain, are 
all a priori possible on the basis of the theory. Such changes in length have been 
observed by Swift ( 1947 ). 


7. THJS EAEINO OF DJflfiE-miAWN curs 

When a cup is deep drawn from a flat circular blank cut from rolled sheet, it is 
often found that the height of the rim above the base is not uniform, as would be 
expected in a symmetrical operation on an isotropic blank. Instead it is observed 
that 'ears* (to use the technical terminology) form in positions symmetrically 
situated with respect to the direction of rolling in the original sheet. Generally four 
ears occur, either at the ends of the two diameters making 45° with the roLling direc¬ 
tion, or at the ends of the diameters making 0 and 90° with tho rolling direction. 
The positions and height of the ears which are observed in any given instance depend 
among other things on the particular metal and on the prior mechanical and boat, 
treatment (see, for example, Wilson & Brick 1945 ). Both types of earing can be 
produced in the same metal by suitably varying the treatment before drawing, 
e.g. in copper and steel (Barrett 1943 ). With brass, six ears have also been observed 
in the 0 and 60° positions (Cook 1937 ). In a few instances other earing positions 
have been reported (e.g. Cook & Richards 1943 ). It is recognized that the presence 
of earing is due to anisotropy in the rolled sheet, and several inconclusive attempts 
have been made to correlate the observed behaviour with the crystal texture and 
the mechanical properties of the material. The present approach is somewhat 



296 R* Hill 

different. The anisotropy is specified by the six parameters of the theory, whose 
values are related in some complicated way to the previous treatment of the material. 
The earing positions can, in principle, be calculated in terms of these parameters 
and the stresses and strains characterizing the deep-drawing process. 

Earing begins while the blank is being drawrn towards the shoulder of the die, and 
it is observed that the final positions of the ears coincide approximately with their 
initial positions (Wilson & Brifck 1945 )* It is probably sufficient, therefore, to analyze 
the stress and strain distribution immediately after drawing begins, when the rim 
has just started to move towards the die aperture. A complete analysis of the whole 
process which allows adequately for the gradual formation of the ears is, in any event, 
hardly possible. In the initial stages of drawing, the action of the blank-holder must 
be considered in any calculation of the stresses. There are two main ways in which 
a blank-holder may be used: it may be held in a fixed position, or it may be pressed 
against the sheet by a constant load. Possibly a blank-holder may not even be used 
at all. It seems that the role of the blank-holder in connexion with earing has not 
been considered; the subsequent analysis indicates its importance. During the 
process the material eventually forming the wall of the cup is drawn towards the 
die aperture, undergoing circumferential compression and radial extension. A 
thickening of the sheet may also occur during this stage, depending on the constraint 
exerted by the blank-holder. Two extreme situations will be considered in the 
analysis. In the first the blank-holder is supposed fixed in a position such that the 
space between the holder and the die is equal to the original sheet thickness. In 
the second it is supposed either that no holder is used, or that the space between the 
holder and die is so much wider than the sheet thickness that negligible normal 
force is exerted in the early stages. If friction can be neglected, the first case corre¬ 
sponds to a state of plane strain, and the second to a state of plane stress. The case 
when a fixed load is applied to the blank-holder will lie somewhere between these 
extremes. 

It is assumod that the ears begin to form at those points on the rim where the 
radial direction is one of the principal axes of the strain-increment. This seems 
reasonable in view of the approximate symmetry of the deformation about the tip 
of an ear. For consistency it must also be assumed that the hollows between the 
ears begin to form at points having the same property. On the rim the circum¬ 
ferential stress is the only non-zero stress component in the plane of the sheet. 
Hence the positions of the ears and hollows correspond to the points at which the 
principal axes of stress and strain-increment coincide. 

Let axes of reference be chosen to coincide with the principal directions of aniso¬ 
tropy in the sheet. The z axis is taken normal to the blank, and the x axis along the 
direction of rolling. If the blank is drawn under conditions of plane strain, it follows 
from the discussion in § 4 that the ears and hollows can only form in the 0,45 and 90° 
positions. These are also the directions for which the uniaxial yield stress in plane 
strain has maximum and minimum values. From elementary considerations it is 
clear either that four ears form in the 0 and 90° positions with hollows in the 45° 



A theory of the yielding and plastic flow of anisotropic metals 297 


positions, or that the reverse situation occurs, This will depend on the state of aniso¬ 
tropy. If the drawing occurs under conditions of plane stress then the discussion 
in § 4 shows that the ears and hollows can form only in the 0 and 90° positions and 
in positions making an angle <j> with the rolling direction, where 

N-F-2I1 

***-*= 0 = 2 /r < 22 > 


It must be remembered in deriving this equation that the state of stress on the rim 
is a tangential compression, so that <p — 90° - 0* where 0 is given by equation (14). 
According to § 4 the tangents at the points where the ears and hollows form are in 
the directions for which the yield stress in uniaxial tension has stationary values. 
Elementary considerations show, just as before, that there are four ears in the 0 and 
90° positions, or in the <f> positions. Notice that ^ = 45° when F — G (i.e. X — 7), 
irrespective of the values of N and H. If, however, N is intermediate to F + 2 H and 
(? + 2//, <f> is not real and only tw r o ears form either in the 0 or the 90° positions. 

It does not seem possible to distinguish between the possible arrangements of 
the ears in terms of the relative magnitudes of the anisotropic parameters, without 
a solution for the displacements on the rim. Such a solution has not yet been found. 
Failing this it is tempting to surmise that the ears and hollows form respectively at 
points where the tangents to the rim are in the directions of the minimum and 
maximum values of the uniaxial yield stresses. This could be tested by experiment. 
It would be essential to perform plane strain or plane stress tensile tests, according 
to the conditions of drawing. Summing up the above analysis, t he theory indicates 
that when the blank is drawn under conditions of plane strain, four ears will form in 
the 0 and 90° positions or in the 45° positions. When ears are found in other positions, 
this may be due either to friction or to the nature of the constraint applied by 
the blank-holder/ 


The author is indebted to Dr K. Orowan, F.R.N., for suggesting the undertaking 
of this research. The content of the paper owes much to many stimulating discussions 
with Dr Orowan during the course of the work. 

The author is also indebted to the Chief Scientific Officer, Ministry of Supply, for 
permission to publish this paper. 

References 

Barrett, C, S. 1943 The structure of metals , p, 443. McGraw-Hill. 

Cook, M. 1937 J, Inal. Met . 60, 159. 

Cook, M. & Richards, T. LI. 1943 J. Inst . Met. 69, 201. 

Cook, M. & Richards, T. LI. 1947 J. Inst. Met. Paper 1061, 541. 

Geiringor, H. 1937 Manor. Set. Math. 86. 

Hill, K., Lee, E, H. & Tupper, S. J. 1947 Proc. Roy. Roc, A, 191, 278. 

KOrber, F. & Hoff, H. 1928 Mitt. K.-WUh.-lnst. Eisenforsch. 10, 175. 

Ktirber, F. & Siebel, E. 1928 Mitt. Eisenforsch . 10, 189. 

Swift, H. W. 1947 Engineering* 163, 253. 

Taylor, G. I. & Quinney, H. 1931 Phil. Trans. A, 230, 323. 
von Mises, R. 1928 Z. angew. Math. Mech. 8, 161. 

Wilson, F. H, & Brick, R. M. 1945 Metals Technol . 12. 



CORRIGENDUM 

Proc. Roy. Soc. A, vol. 190, p. 370, equation (7) 
For A 8 read A a . 




Self-consistent field, with exchange, for nitrogen and sodium 
By D. R. Habtree, F.R.S. and the late W. Habtbee 
{Received 20 June 1947) 


Wave functions for the normal configurations of neutral nitrogen and N“ have been cal* 
culeted by the method of the self-consistent field with exchange (Fock’s equations). To the 
accuracy of the approximation represented by these equations, the N~ ion would be unstable 
and liable to auto-ionization, but it is estimated that a better approximation of the treat* 
ment of a many-electron atom would give a small positive ionization potential for N~. 

Revised wave functions for Na + and the normal state of neutral Na have also been cal¬ 
culated. 

Tables of results are given. 


[Note by D.R.Il. At the outbreak of war in September 1939, calculation of wave 
functions for a number of atoms, carried out by my father (W.H.), had been 
completed, but the results were still unpublished. Both he and I soon became 
occupied with other work, and the atomic structure calculations were laid aside. 
But to avoid tta possibility of total loss of these results by enemy action, a copy 
was sent to Professor J. C. Slater at M.I.T., with a suggestion that he should, if 
possible, take steps to get them published in the U.S.A. so as to make them 
generally available. Professor Slater entrusted to Dr Millard F. Manning the 
preparation of these results for publication, and most of them were published, 
with some additional results by Manning, in the Physical Review (Hartree, 
Hartree & Manning 1941 a, b,c)\ I would like to take this opportunity of expressing 
my appreciation of the work of Dr Manning in this connexion. Dr Manning died in 
1942, leaving a few of these results still unpublished, and so they remained, since 
the scientific personnel of the U.S.A., as of Britain, was by then mainly engaged 
on work more directly related to the allied war effort. 

This paper presents the outstanding results for nitrogen (N~ and neutral) and 
sodium (Na + ). My father died in 1943 so that I have only his note-books to refer to 
in writing up the results. But the work seems straightforward, with no unusual 
features, and there is little to do but record the results. 

Added in proof. Since writing.this paper, I have found among my father’s 
papers some calculations (without exchange) for Au; the results will be published 
separately.] 

1. Nitrogen. Wave functions 

The normal eleotron configuration of neutral nitrogen (N°) is (l«)*(2s)*(2p) J , 
giving terms 4 S, *D, *P in that order (*S deepest); that of N~ is (la)*(2a)*(2p) 4 
giving *P, *D, 1 S. Apart from the value of the atomic number, the equations are the 
same as for singly ionized oxygen and neutral oxygen respectively (Hartree, Hartree 


30 


Vol. m* A. (a July 1948 ) 


[ 299 ] 



300 D. E. Hartree and the late W. Hartree 

& Swirles 1939 , referred to as I). In the present work, superposition of configurations 
was neglected, so that these equations are obtained by putting p <* 0 in equations 
( 6 ) and ( 7 ) of I. Then, for each of the configurations mentioned, the different states 
are distinguished by the values of a single coefficient ft, which has the following 
values (see I, table 1 ): 

N« N~ 

r -*-\ ‘ — " » 

<s »D *P *P »S 

ft -I —fa 0 -f -fa 0 
+ 

Results for oxygen (I, § 5) show that the normalized radial wave functions P N for 
the different terms arising from a single configuration are nearly linear in ft, bo for 
nitrogen the equations have only been solved for the extreme values of ft for each 
configuration. The results are given in table 1 . If values of P N are required for inter¬ 
mediate values of ft, they can be obtained by linear interpolation in ft. 

2. Nitrogen. Energy values 

The values of the various F and 0 integrals (see I, § 2 ) and e-parameters occurring 
in the expression for the contribution to the energy from the ( 2 a) (2p) shell are given 
in table 2 (compare I, table 5); the coefficients with which these qimntities occur in 
this expression are given in I, table 2 (column headed E A {/i = 0 )). The values of the 
e-parameters in table 2 were obtained in the course of the determination of the wave 
functions; the values of the F and O integrals were calculated from these wave 
functions. For the intermediate values of ft (— fa for N° and - fa for N~), values 
of the F and 0 integrals and e-parameters were estimated by linear interpolation 
in ft between the values in table 2 . 

From these quantities, the total energy of the ( 2 s) ( 2 p) shell was calculated; the 
results are given in table 3. Since the values of some of the F integrals are multiplied 
by factors up to 8 , the third decimal of the individual tabulated values is hardly 
significant, but it is retained since the calculated ratio of the inter-multiplet separa¬ 
tions is probably reliable to an acouracy represented by 2 units in this place. 
Observed values of the ionization energy of the three states of neutral N are given in 
table 3. No calculated values are available for comparison, since no calculations 
were carried out for N + ; the purpose of including these observed values is to give 
a comparison between oaloulated and observed inter-multiplet separations. 

The general character of the results for neutral N is very similar to that for 0 + 
(see I, table 7); the calculated *D~ 4 S separation is slightly larger than that observed, 
whereas the calculated *P-*D separation is about 80 % larger than that observed, 
so that the calculated value of the ratio (*P-*D)/( a D- < S) is 0 * 88 , as oompared with 
the observed value 0-80; to the accuracy of the oaloulated values, these figures are 
the same as for OJ. 

The results in table 3 show that, to the acouracy of the approximation on which 
these calculations are based, the N~ ion iB unstable in the sense already defined else- 



Self-consistent field, with exchange, for nitrogen and sodium 301 

where (Hartree & Hartree 1938 , § 1), namely that it is possible to find a solution to 
the equations, but the total energy of the configuration is greater than that for the 
neutral atom, so that the ion would be liable to auto-ionisation. However, the 
calculated energy differences (N°, 4 S) — (N~, *P) is only -0-0641 atomic unit, and 
the results for oxygen suggest that this calculated value is likely to be too low by 
0-08 atomic unit and probably more. Hence it seems probable that the lowest state, 
at least, of N~ is stable; whether the higher states arising from the normal configura¬ 
tion are stable or not is quite uncertain. 

These results emphasize the importance of using as good an approximation to the 
general form of the wave function of a many-electron atom as is practicable, in 
calculations referring to negative ions. Fock’s equations for the self-oonsiBtent field 
with exchange do not form a' good enough approximation for the purpose; it is 
probably necessary explicitly to include in some way the dependence of the wave 
function on the mutual distances r {j between the electrons, at least between those 
occupying the outer wave functions such as the ( 2 p) wave functions of 0 ~ or N”. 
The development of a means of doing this whioh leads to a practicable computing 
process is very much required in this context. 

3. Sodium (Na + ) 

Sodium was the first atom for which solutions of Fock’s equations were evaluated, 
this pioneer work being done by Fock & Petrashen ( 1934 ). Work on Na“ (Hartree & 
Hartree 1938 ) gave results for the core wave functions [(Is), ( 2 s) and ( 2 p)] which 
differed from Fock & Petrashen’s wave functions for Na + by considerably more than 
was expected as a result of adding the (3s ) 2 group to form Na“, and considerably 
more, also, than the perturbation of the K + core on adding the (4s ) 2 group to form K~. 
Further, the differences between Fock & Petrashen’s results for Na + and ours for 
Na~ were rather irregular, and separate examination of the two sets of results traced 
these irregularities to the Na + results. 

Fook &, Petrashen evaluated the solutions of the inhomogeneous differential 
equations for the radial wave functions with exchange by a method based on the 
use of the Green’s function. This, while of course formally correct, does not seem the 
most suitable method for practical computation, and in view of the irregularities 
already mentioned, it seemed advisable to recalculate Na + by straightforward 
numerical integration. This, it must be emphasized, implies no criticism of Fock & 
Petrashen’s work which, beoause of its pioneer character, was necessarily done 
without previous experience of handling these equations, and was a considerable 
achievement. Since then, however, much experience has been gained in the course 
of evaluating solutions of these equations for other atoms, and it seemed advisable 
to revise the results for Na + by calculations done in the light of ail this experience. 

The results are given in table 4, and show only small departures ( 0*002 at most 
in Pff) from those for Na~, though the values of e are considerably different on account 
of the contribution to the potential, in the region of the core of Na~, from the ( 8 a)* 


20-2 



302 


D. B. Hartree and the late W* Hartree 

group. The differences from Pock & Petrashen's ware functions are, however, quite 
appreciable, up to 0*017 in P(2e) and 0*014 in P(2p). In view of the discrepancies 
from Fock & Petrashen’s results, the work was done with particular care, and it is 
believed that the present results are correct to the accuracy of the calculations. 

The (3s) wave function for Na° was also calculated, taking the core wave functions 
tabulated in table 4; the result is included in table 4* It was found that to secure 
orthogonality of P(3a) to P(2s) a small but appreciable non-zero value of the non¬ 
diagonal parameter e u had to be taken « 0*0045). 

Tables of results are given. 


Table 1. Nitrogen. Normalized wave functions 


neutral N 


( --- 

fl~-t (Estate) /7=0(*P 


r 

P(U) 

P( 2s) 

P(2p) 

P12$) 

0*00 

0 

0 

0 

0 

0*01 

0-327 

0-072 

0*0009 

0*073 

0*02 

0*611 

0*134 

0*0037 

0*130 

0*03 

0*856 

0-187 

0*0080 

0*190 

0*04 

1064 

0*232 

0*0137 

0*235 

0*06 

1*389 

0*300 

0*029 

0-304 

0*08 

1*615 

0*343 

0*048 

0*347 

0*10 

1*761 

0*365 

0*070, 

0*370 

0*12 

1*845 

0*371 

0-095* 

0-370 

0*14 

1*880 

0*363 

0*122 

0*308 

0*16 

1*878 

0*344 

0*150 

0*349 

0*18 

1*848 

0*316 

0*179 

0*320 

0*20 

1*797 

0*281 

0*209 

0*285 

0*25 

1-613 

0*172 

0*283 

0*174 

0*30 

1*393 

0*047 

0*356 

0*047 

0*35 

1*173 

-0*083 

0*425 

-0*084 

0*40 

0-969 

-0*210 

0*488 

-0*212 

0*45 

0*789 

-0*328 

0*546 

-0*332 

0-50 

0*636 

-0*436 

0*597 

-0*441 

0*6 

0*404 

-0*614 

0*680 

-0*021 

0*7 

0-260 

-0*742 

0*739 

-0*750 

0*8 

0*153 


0*770 

-0*833 

0*9 

0*092 


0*795 

-0*878 

1*0 

0*055 

-0*889 

0*800 

-0*894 

M 

0*033 

-0*884 


-0*888 

1*2 

0*019 


0-777 

-0*865 

1*4 

0*007 

-0*788 


-0*788 

1*6 

0*003 


0*663 

-0*092 

1*8 

0*001 

-0*597 



2*0 

— 




22 

— 




2*4 

— 

-0*349 


-0*342 

2-6 

— 


0*340 


2*8 

— 


0*297 

-0-228 

3*0 

— 

-0*189 

0*254 

-0*184 


N" 


- % 

t —-- 



* "" ■\ 

state) 

/?= -f(*P state) 

ft sz 0 (*S state) 

P(2p) 

pm 

pm 

pm 

pm 

0 

0 

0 

0 

0 

0*0009 

0*070 

0*0008 

0*071 

0*0008 

0*0036 

0*130 

0-0032 

4 0*132 

0*0031 

0*0079 

0*182 

0*0070 

0*184 

0-0068 

0*0135 

0-225 

0*0120 

0*228 

0*0117 

0-0285 

0*291 

0-025 

0*294 

0*024, 

0*047 5 

0-333 

0*042 

0*336 

0-041 

0*069, 

0-355 

O*O01 5 

0*359 

' 0*000 

0*004 

0-360 

0*083 

0*304 

0*081, 

0 *120, 

0*353 

0-106 

0*357 

0-104 

0*148 

0*334 

0-181 

0-338 

0*128 

0-176, 

0*307 

0*150 

0-310 

0-153 

0*206 

0*273 

0*182 

0*276 

0*178 

0*270 

0*168 

0*247 

0*169 

0*242 

0*350 

0*046 

0*310 

0*040 

0*304 

0*417 

- 0*080 

0*370 

-0*081 

0*363 

0*479 

-0*203 

0-426 

-0*205 

0*417 

0*535 

-0*318 

0*476 

-0*821 

0*407 

0-585 

-0*423 

0*521 

-0*427 

0*611 

0*005 

-0*595 

0*595 

-0*602 

0*582 

0*722 

-0*720 

0*649 

-0*728 

0*634 

0*758 

-0*803 

0*084 

-0*811 

0*668 

0*776 

-0*849 


-0*850 

0*689 

0*781 

— 0*867 

0*715 

-0*874 

0*698 

0*775 

-0*864 

0*715 

-0*870 

0*698 

0*701 

-0*846 

0*708 


0*090 

0*717 

-0*780 


-0*782 

0*663 


-0*695 

0*638 

-0*093 

0*624 

0*596 


0*592 


0*582 


-0*520 

0*545 

-0*515 

0*538 

EZE9 


0*498 

-0*436 

0*496 

0*417 

-0*374 


-0*300 


0*366 

-0*314 

0*414 

-0*300 

0*413 


-0*263 

0*370 

-0*255 

0*333 

0*278 

-0*220 

0*342 

— 0*211 

0-852 



































Self-consistent field, with exchange , for nitrogen and sodivm 303 


Table 1 (ccmi.) 


3- 2 
3*4 
3*0 
3*8 
4*0 

4- 5 
5*0 
5*5 
0*0 
0*5 
7*0 

8 

9 

10 

11 

12 

14 

16 

13 




neutral N 

A 




N~ 



/?=-f(«S State) 

(*P state) 

state) 

yj=0 ( l S state) 

P(IB) 

pm 

P(2p) 

pm 

pm 

pm 

P(2p) 

pm 

pm 

— 

— 0*154 

0*210 

-0*149 

0*241 

-0*183 

0*310 

-0*175 

0*323 

— 

-0*123 

0*184 

-0*119 

0*208 

-0*153 

0*282 

-0*145 

0*290 

— 

-0*099 

0*156 

-0*095 

0*180 

-0*127 

0*256 

-0*120 

0*272 

— . 

-0*079 

0*132 

-0*076 

0*155 

-0*100 

0*232 

-0*100 

0*250 

— 

-0*003 

0*111 

-0*060 

0*133 

-0*088 

0*210 

-0*083 

0*230 

— 

-0*030 

0*072 

-0*034 

0*090 

-0*055 

0*100 

-0*052 

0*189 

— 

-0*020 

0*040 

-0*019 

0*000 

-0*034 

0*131 

-0*032 

0*155 

— 

-0*011 

0*030 

-0*011 

0*040 

-0*021 

0*103 

-0*020 

0*127 

— 

-0*000 

0*019 

-0*006 

0*026 

-0*013 

0*082 

-0*013 

0*105 

— 

-0*003 

0*012 

-0*003 

0*017 

-0*008 

0*005 

-0*008 

0*087 

— 

-0*002 

0*008 

-0*002 

0*011 

-0*005 

0*062 

-0*005 

0*072 

— 

— 

0*003 

— 

0*005 

-0*002 

0*033 

-0*002 

0*049 

— 

— 

0*001 

— 

0*002 

-0*001 

0*021 

-0*001 

0*034 

— 

— 

— 

— 

0*001 

— 

0*013 

— 

0*024 

— 

— 

— 

— 

— 

— 

0*008 

— 

0*010 

— 

— 

— 

— 

— 

— 

0*005 

— 

0*011 

— 

— 

— 

— 

— 

— 

0*002 

—. 

0*005 

— 

— 

— 

— 

— 

— 

0-001 

— 

0002 

— 

— 

— 

— 

— 

—- 

— 

— 

0*001 


Table 2. 

Nitrogen. 

F AND 0 INTEGRALS AND 

€ VALUES 



N° 

A 


N" 

A 


t 

> 

r ‘ 

^ , 


4 S 

S P 

sp 

IS 


-i 

0 


0 

F 0 (2s, 2#) 

0*085 

0*690 

0*059 

0*006 

F t (2», 2 p) 

0*008 

0*058 

0*594 

0*583 

F*{2p, 2 p) 

0*055 

0*030 

0*542 

0*518 

F »(2p, 2 p) 

0*294 

0*280 

0*231 

0*217 

O l (2e, 2 p) 

0*410 

0*401 

0*354 

0*344 

6 u,u 

1*890 

1*958 

1*161 

1*250 

e i»,m 

1*126 

0*941 

0*187 

0*107 


Table 3. Nitrogen. Energies 


calculated total energy observed ionization 
of (2s) (2p) shell energy 


N° 

4 S 

-9*051 


*D 

-9*559 


»P 

-9*496 


sp 

-9*587 


l D 

- 9*533 


l S 

-9*444 


0*535 

0*447 

0*403 


88 

44 



304 D. R. Hartree and the late W. Hartree 


Table 4. Sodium. Normalized wave functions 




Na + - 

A 


Xa» 


2 Ja + 

_A - ■- 

Na° 

r 

PM 

pm 

pm 

pm 

r 

pm 

P {2 p ) 

pm 

0-00 

0 

0 

0 

0 

3*2 

-0*013 

0*035 

0*508 

0*01 

0*629 

0*154 

0*004 

0*023 

3*4 

-0*009 

0*026 

0*510 

0*02 

1*128 

0*274 

0*014 

0*041 

3-6 

-0*006 

0*019 

0*508 

0*03 

1*617 

0*360 

0*030 

0*055 

3*8 

-0*004 

0*014 

0*502 

0*04 

1*814 

0*433 

0*050 

0*065 

4*0 

— 0*002* 

0*010 

0*493 

0*06 

2*192 

0*507 

0*102 

0*076 

4*5 

-0*001 

0*004, 

0*458 

0*08 

2*366 

0*517 

0*163 

0*077 

5*0 

— 

0*002 

0*414 

0*10 

2*377 

0*481 

0*231 

0*071 

5*5 

— 

0*001 

0*366 

0*12 

2*305 

0*412 

0*302 

0*060 

00 

— 

0 *000 5 

0*318 

0*14 

2*176 

0*320 

0*374 

0*040 

6*5 

— 

— 

0*272 

0*16 

2-012 

0*216 

0*444 

0*030 

7*0 

— 

— 

0*230 

0*18 

1*834 

0*102 

0*512 

0*013 

7*5 

— 

— 

0*192 

0*20 

1*052 

-0*015 

0*578 

- 0*005 

8*0 

— 

— 

0*160 

0*25 

1*227 

-0*301 

0*724 

-0*049 

9 


— 

0*108 

0*30 

0*878 

-0*666 

0*842 

-0*087 

10 

— 

— 

0*072 

0*36 

0*614 

-0*763 

0*933 

-0*117 

11 

— 

— 

0*047 

0*40 

0*422 

-0*923 

0*997 

-0*138 

12 

— 

— 

0*030 

0*45 

0*287 

-1*037 

1*039 

-0*152 

13 

— 

— 

0*019 

0*60 

0*194 

—1*112 

1*001 

-0*158 

14 

— 

— 

0*012 

0*6 

0*088 

— 1*169 

1*001 

-0*153 

10 

— 

— 

0*005 

0*7 

0*040 

-1*139 

1*021 

-0*131 

18 

— 

— 

0*002 

0*8 

0 *0; 8 

-1*001 

0*056 

-0*097 

20 


— 

0*001 

0*0 

0*008 

-0*968 

0*880 

-0*058 





1*0 

0*003 

-0*846 

0*799 

-0*015 





M 

0*001 

-0*736 

0*718 

+ 0*028 




Na° 

1*2 

— 

-0*032 

0*640 

0*072 


(— 

A 

, . ^ 


1*4 

_ 

-0*466 

0*601 

0*156 


pm 

P{ 2 e) P( 2 p) 

pm 

1*0 

_ 

-0*319 

0*385 

0*232 

r 

r 

r r * 

r 

1*8 

_ 

-0*220 

0*292 

0*297 

0*00 

70*25 

17 16 3892 

2*67 5 

2*0 

-- 

-0*149 

0*219 

0*354 

0*01 

02*94 

15-30 36-85 

2*30, 

2*2 


-0*101 

0*163 

0*400 

0*02 

50*41 

13-71 34-91 

2*06 

2*4 

_ 

-0*008 

0*121 

0*437 

0*03 

50*57 

12-21 33-09 

1*83 

2*0 

— 

-0*046 

0*089 

0*400 

0*04 

45*35 

10-84 31-38 

1*62 

2*8 

_ 

-0*030 

0*005 

0*487 





3*0 

— 

-0*020 

0*048 

0*501 

€ nl , Hi 

81*46 

0-16„ 3-6Q, 

0*301 


References 

Fock, V. & Petrashen, M, 1934 Phys. Z. Sowjet. 6, 308 , 

Hartree, D. R, & Hartree, W. 1938 Proc. Comb, £oc. 84, 350, 

Hartree, D. R., Hartree, W. & Manning, M, F. 1941 a Phye, Rev, 59, 299 , 

Hartree, D, E,, Hartree, W. & Manning, M. F. 1941 6 Phye, Rev, 59, 300. 

Hartree, D, R. ( Hartree, W. Sc Manning, M. F, 1941c Phye, Rev, 60 , 857 , 

Hartree, D. R., Hartree, W. & Swirlea, B. 1939 F/UJ. Trane, A, 238, 229 . 



The accuracy of atomic co-ordinates derived from Fourier 
series in X-ray crystallography. V 

By A. D. Booth and K. H. V. Britten - 


(Communicated by O . N. Watson , F.R.S.—Received 1 October 1947) 


In the fifst paper of this series, the effect of experimental errors in the Fourier coefficients, 
upon the derived atomic co-ordinates was investigated. The assumption was made that the 
probable errors wore independent of the magnitudes of their parent Bragg reflexions. 

It has been suggested that a more accurate assumption would be to take the probable 
error of any coefficient as being proportional to the magnitude of that coefficient. The present 
paper develops the theory on this basis, and a solution in closed form is obtained. 

The question of how the experimental error in the co-ordinates varies with the position 
of the atom in relation to its neighbours is also investigated, and it is shown that the variation 
is much smaller than that previously derived for finite summation errors. 

( 

1 , In the first paper of this series (Booth 1947 a) it was shown that if, in the usual 
Fourier series 

i+zf-fjfir+ii r 1 hj* Jm§ /jA “i 

D(x,y,z) = _S S.tmfc.i)|cos^ 27 r(- + -| + -j- a (A,t,i)J (1-1) 


for the electron density D(xyz) at any point (xyz) in a crystal lattice, the | F | as 
determined by experiment are subject to a probable error Ae the probable error in 
the atomic co-ordinate is given by 


0-66Ae 
^ NJV ’ 


(1-2) 


where N is the atomic number of the particular atom. 

The treatment which led to the above result was intended to give only an upper 
limit to the probable errors e m , etc., and whilst this is adequate in overall assessments 
of the excellence of a set of atomic co-ordinates it is often desirable to know which 
atoms, in any particular structure, are the most subject to experimental error. 
In addition the treatment has been criticized for assuming that the experimental 
error, Ae, is independent of the intensity of the parent plane. 

It is the purpose of this paper to extend the treatment to the ease in which errors 
are assumed to be proportional to the | F | values of the parent reflexions, and also 
to investigate the manner in which the error varies with the position of an atom in 
the unit cell. 


2 . It was shown, in the treatment leading to (M), that an exact expression 


for 6 X is 


where &F is the experimental error in the value of F(hkl) 

[ 805 ] 


( 2 - 1 ) 



A* D. Booth and K. EL V. Britten 


a w cc(h t k, l )—the phase angle of the plane (hkl )—and the atomic density dis¬ 
tributions near to atoms are assumed to be given by 

d(x t y,z)**N |Jexp(-p{(x-^) a + (y*y 1 )V(s-2 1 ) 8 })^ (2-2) 

where N is the atomic number of the particular atom and p is approximately con¬ 
stant for light atoms. 

Similar expressions hold for e y and e z . 

If the proportionality of errors and Fourier coefficient envisaged in § 1 is made, 
80 that A F - Pe, (2-3) 

where Pe is constant, ( 2 - 1 ) reduces to 

Pe /ir\*( +HK L U 

to(p) Ui? A ^ 8in,( "- a) ) • (2 ' 4) 

Before evaluating the summation, further reduction has to be made to eliminate 
F and a. It was shown in part II that 

N j N ' 

tana « 2 / r 8 in 0 r / 2 / r cos 0 r , 

r«i / 1 

(2-5) 

( N \ 8 / N \ 2 V 9 

2 fr sin 6 r j 4* / E fr 008 j ■ 

N N 

Whence Fsina « 2/r sil i0 r , Fcosa *= 2 / r oos^ f , ( 2 - 0 ) 

r -1 r-l 


Where, for the distribution (2*2), 


. at / wT** ** 
/,.A;exp(- ? [_-, + si + -j) 


and the unit cell is assumed to contain N independent atoms. 
Substitution from (2*6) in (2-4) yields 

Pe „ 


m€xt °~N m Va 


/ n \i(+B+K+L fN -I2U 


the suffix m inchoating that the error attached to the mth atom is being considered. 

In view of the rapid convergence of the function/,, defined by (2*7), the error 
involved in taking the limits (HKL) of (2-8) as infinite will be small, and the experi¬ 
mental errors will be nearly independent of the wave-length of the X-rays used to 
obtain the experimental data. Thus, using (2*7) 

Pe /ir\#f +• . / 2jr*R* V P1\V£ „ . .. 


*"* N m Va 




(2*9) 



The accuracy of atomic co-ordinates in X-ray crystallography 307 

An, approximate value for the sum of the aeries can be found in the form of an 
integral, denoting the expression within the braces { } by S, ♦ 

HlRs** 1 (B ~ * r) I exp (■~?[S + dhdkdL (2 ‘ io) 


Now 


rj #«in(0-0 r )T- XWBin*(d-d r ) + 2 £ N r N $ sm(0-Q r ) sin((9-0, 
Lr-1 , J rm 1 r t »~l 


r>s 

»Si^[l-0O92(0- 0 r )] + S N r N,[cos(0,-0.) 

r«*l r,8«*l 

r>8 


-cos(0-0 r +0~0 8 )], (2-11) 

from which it is seen that S (2-10) consists of four parts: 

$ *= I l — I i + I a — I t , 


( 2 - 12 ) 


where 


• = ,1, f //j*’ “ p (- y 0? + r' + ?]) dh iUl ' 


~ CO 
■+* <30 


+ 00 

4 « £ N r N 8 J||A 2 cos2^(a; r -a: < ) + |^ r -y,)+^(i r -2,)J 

/ iv 2 r * 2 *•, ...... 

xex n-Fb + ^ + c 2 J) dA ^ 

4- oo 

4= S^r-^IJI^ 008 2v^(x-a: r + x-a: 4 ) + |(2/-y r +2/-y 8 ) + |.(2-2 r +2-2 (1 )J 

/ 2ff*r** A* , PT\ „ 

xexp (~TLo* + ^ d) 

3, It is seen that all the integrals (2*13) can be derived from 
+ *> 

J • Jjj** cos (Xh + Yk + Zl )exp (- eh 3 ~fk* - gl 3 ) dhdkdl. 


( 2 - 


(3-1) 


Now 

I- 

where 


J* exp (—gP)j exp (— h* cos {Xh + f/,) exp (— eh 3 ) dhdkdl, (3*2) 

/»« Yk+Zl. (3-3) 



308 A, D. Booth and K. H. V. Britten 

Considering the first integration 

/ + CO /*+ 00 

A 2 cos(XA+/i)exp(-eA 2 )dA * A* cos XA cos /j exp (- eh*) dh 

- 00 J - CO 

since the integral involving Bin (xh) is zero on account of the asymmetry of the 
integrand. Now . +00 . . y^ 

I cos(XA)exp( — eh 2 )dh = /-expf — —J, (3-4) 

whence, differentiating both sides with respect to e, 

(3-5) 

Thus 

1 ~ kj-e [*- 10 ( -S)l-> i> ( - #) l “ cos (Yk + Zl) exp (— fk?) dkdl t 

(3-6) 

and, using (3*4) with suitable changes of symbols, 

4 . Using the integral (3*7) the values of (2*13) can be written down. 

i.JiZWJiV 1 

h - E 1 1 - 4P(* - *,)*] ex P ( - 2p[(* - *r) 8 + (y - 3/r) 8 + (* - *,•)*]) 

A - S i ^w(£) l [ 1 -J > (*r-*») a ]exp(-|[(* r -*,) a +(y,-y,)* + (z r -z,) , ]J I (4- 

r>a 

r>* ( V \ 

xexpl-|[(a;-a; r +a;-z:,) a +(y-^ + 2/-y J ) 2 + (z-z r + *-z,) 2 ]l. 

Whenoe, from (4-1), (2-12), (2-10) and (2«9), 

* - - mjv 

x exp (— 2p[(* m —a:,) 2 + (y m — y r ) a + (z m —« r ) 2 ]) + 2 2 

r>» 

x exp | [(av- *«! )• + (y r - y.) % + (*r - *.)*]) - -p(x m -x r + x m —*,)*] 

x exp [ -| ([* m - +a? m ~ a:,] 2 + [y m -y r +y m -y,] a + [z m - z r +z m - z,]*)Jj*» 

/ A _0\ 


vfix - 2 N . 


with exactly similar expressions for m e y and m e,. 



The accuracy of atomic co-ordinates in X-ray crystallography 309 

For the purpose of calculation it is more convenient to consider, not the com¬ 
ponents of error, e x , e v and e t , but the total error s m , defined by: 

~ m®* ‘b d" »/*■ (^*8) 

Equations (4'2) show that 


e '« * 

+ 2 s N r N t [Z - ±p{\R TS ?] exp ( - 2p(\R n f) 

r>$ 

- N r N„[ 3 - 4pBl r ,] exp (- 2p^ rs )j*. (4-4) 

where lP m = (x p - x q f + (y p - y q f + (z p - z a )*, 

Rmr , = [* m “ H x r + *,)]* + \$ m ~ far '+ Vs )? + ( z m~ i( z r + 2 »)]*- 
This suggestive notation shows that, on account of the rapid evanescence of the 
exponentials with increasing values of i?, only nearest neighbour atoms need be 
considered in summations involving m . 

5. An evaluation of the errors predicted by (4*4) for some actual molecular con¬ 
figurations shows that, in three-dimensional synthesis where very close approach of 
atomic maxima is prohibited, all terms in the summations, except R mm , are negligible. 
The simplified formula p . .j { N w 

<«•» 


(4-5) 


proves to be adequate. Here co r is the ‘weight’ ( N r fN m ) of the rth atom with respect 
to the atom m under consideration, and the prime indicates that the term r = m is 
to be omitted from the summation. 

For structures containing only atoms of approximately equal atomio numbers 
the formula 


Pe/jr \ l | 

; 2 \2p/ ( 


3 (N-l) 


(5-2) 


is adequate. Thus, for a benzene ring configuration, the expression in braces {} 
of (5-2) has the value tl5/F) against the exaot value, given by (4-5), (14-05/P). 

6 . For completeness, the modifications of (4-4) for two- and one-dimdnsional 
summations have been derived, and it is found that all the results can be summarized 
in the formula 

Pp 1 IT \«*+») ( n 

±207.(£) 

+2 £ Km* - rnmn «p < - 

T>$ 

- N r N $ [n - 4pPL.] exp (- 2pie^,)j\ 


(8*1) 



310 A. D. Booth and K, H. V. Britten 

where n « number of dimensions in summation, 
v * generalized ‘volume* of repeat unit, 

= volume for 3-dimensional summations, 

« area for 2 -dimensional summations, 

» length for 1 -dimensional summation. 


In a like manner (5*1) and (5*2) can be extended, but, since in projection approach 
to coincidence is possible, their use is limited and care must be exercised to ensure 
their validity. 


(5-1) beoomes 

Pc / 
e "*“ ± 2 Jv\. 

*) uH 

i 

t 

(6-2) 

and (5-2) 

Pel n 

\Kt+»wjr_i)] 

1 V ) 

1* 

(6*3) 


7 . An attempt has been made to obtain a value for the most probable proportional 
error, Pe in the case of oxalic acid. A comparison of the experimental F{hkl) values 
for this substance, obtained independently by Robertson & Woodward ( 1936 ) and 
by Brill, Hermann & Peters ( 1942 ), gave a value: 

Pe * 0*30, 

but the fit of the data to a normal error curve was much less satisfactory than that 
obtained for the absolute error which gave (Booth 19476 ) 

Ae * 0 * 82 . 

The errors in co-ordinates, predicted by the absolute and proportional error formulae 
respectively are, for oxalic acid 

e = 0 * 01 0 A, e = 0 * 01 7 A. 1 

So that it is seen that the present treatment suggests errors approximately twice 
as large as the more naive methods of part I. At the same time it must be remem¬ 
bered that the method of summation, via an infinite integral, includes all terms 
possibly included in the Fpurier summation for electron density, i.e. makes the 
limits H, K, L, in (M) infinite. The same assumption in the simple treatment with 
absolute errors would result in an infinite error in the atomic co-ordinate. The correct 
solution undoubtedly lies somewhere between the two alternatives. 

The method of the present paper is easily extended to any law of variation of 
error with Bragg angle of reflexion and should such a law be found, which was in 
good accord with observation, it might prove worthwhile to attempt the integrations 
—although the probability of a result in dosed form is small and approximate 
methods would have to be used. 

Refebbncbs 

Booth 1947a Proc . Boy. Soc . A, 188 , 77 . 

Booth 19476 Proc . Boy. Soc . A, 190 , 490 . 

Robertson A Woodward 1936 «I. Chem. Soc . p. 1817 . 

Brill, Hermann & Peters 194a Ann. Phya. t Lpz 42 , 357 , 



A study of the interaction of radio waves 

By J. A. Ratcuffk and I. J. Shaw 
Cavendish Laboratory , Cambridge 


(Communicated by Sir Edward Appleton , F.R.S.—Received 6 November 1947) 


Experiments are described in which the phenomenon of wave-interaction (‘Luxembourg 
effect’) is used to provide information about the height at which radio waves of different 
frequencies are absorbed in the ionosphere. It is first demonstrated by two crucial experiments 
that the absorption mechanism suggested by Bailey & Martyn (1934 a and 6) is the true one. 

Measurements of the phase of the modulation transferred from one wave to the other by the 
non-linear absorption process in the ionosphere are described; and it is shown how, by 
measuring this phase at different modulation frequencies, it is possible to locate the region 
where the interaction occurs. The results of a series of experiments summarized in tables 
2, 3a and 36 and figures 8 and 9 are discussed. 

The conclusion is reached that the frequency with which electrons collide with neutral 
molecules at a height of about 85 1cm, is of the order 5x 10* sec." 1 , and that this is the 
height near which the main absorption of waves of frequency 1 Moyc./sec. and 200 kcyc./sec. 
are absorbed at night. Waves of frequency 90 and 08 kcyc./sec. are absorbed, and possibly also 
reflected, below' thiB level. With the approach of dawn the regions responsible for absorbing 
1 Mcyc./sec. and 200 kcyc./sec. waves drift apart. 

The theory of Bailey & Martyn (19346) and Bailey (1937 a) is related to modem theories 
of ionospheric absorption and is restated with the standard nomenclature of Appleton’s 
magneto-ionic theory. 


1 . Introduction 

It is well known that a radio wave travelling through the ionosphere may, under 
certain circumstances, interact with a second wave in such a way that a modulation 
imposed on one of them becomes transferred to the other. This effect has been called 
‘wave-interaotion’ or, sometimes, the ‘Luxembourg effect’ because the first obser¬ 
vations of the effect were made with one set of waves sent out from the broadcasting 
station at Luxembourg. In those examples of this phenomenon which were first 
observed the interaction produoed unwanted modulation superimposed on the 
modulation of the radio station which it was desired to receive. It was therefore 
natural to call the wave whose reoeption was desired the wanted wave and that from 
which the undesired modulation was transferred was oalled the interactive wave. 
We shall preserve this nomenclature throughout this paper: it is illustrated 
schematically in figure 1. 

Many experiments have been made to investigate wave-interaction, and especially 
to discover the factors which determine its magnitude (van der Pol & van der Mark 
(1935); van der Pol (1935); Baiimler & Pfitzer (1935); Grosskopf (1938); and Huxley, 
Foster & Newton (1947)). Interaction has so far been observed only with wave¬ 
lengths in the broadcast band (200 to 1500m.) where it has been found with wanted 
and interacting stations both in the medium-wave band, both in the long-wave 

[ 311 ] 



312 J. A, Ratcliff© and 1 J. Shaw 

band, or with one in each band. In the latter case the wanted wave may be either 
in the medium-wave or the long-wave band. 

To produce interaction, the ionosphere must behave as a 4 non-linear* trans¬ 
mission medium, and Bailey & Martyn ( 1934 a) have suggested that the absorption 
of the waves provides the necessary non-linearity. They first put forward their 
suggestion before ideas about the ionospheric absorption of waves were very clear; 
their theory may now be restated as follows. The main absorption of the wanted 
wave occurs over fairly well-defined portions of the trajectory which are at a level 
different from that where the main bending takes place. This absorption depends 
on the frequency with which electrons collide with neutral molecules, and anything 
which increases the velocity of the electrons, and therefore the frequency of these 
collisions, will increase the absorption. If now another wave (the interacting wave) 
is absorbed in the same place the energy removed from it will increase the velocity 



Figure 1 . Schematic diagram to illustrate nomenclature. 

of the electrons,* and therefore the frequency of their collisions, with the result 
that the wanted wave will be more strongly absorbed. In the presence of the inter¬ 
acting wave the wanted wave will therefore be weaker. If the amplitude of the 
interacting wave is now slowly varied periodically the velocity of the electrons will 
follow in step, and the absorption of the wanted wave will also vary periodically: 
in this way the modulation of the interacting wave will become superimposed on 
the wanted wave. According to the theory of Bailey & Martyn this is the mechanism 
of wave interaction. 

If the interacting wave were suddenly removed the velocities of the electrons 
would fall exponentially to their normal values with a time-constant which would 
depend on the energy lost by an electron at each collision. We shall call this time- 
constant the 'relaxation time' of the electrons and shall denote it by r; we shall 

* It is interesting to note that a 100 kW transmitter working on a frequency of 200 kcyc./sec. 
is capable of increasing the temperature of the electrons in the appropriate part of the iono¬ 
sphere by about 2% (see §6(a)). 



A study of the interaction of radio waves 313 

see later that its magnitude is of the order of 10~ 3 sec. If the modulation frequency 
is increased so that it becomes comparable with (27TT )- 1 then the electrons will begin 
to be incapable of following the modulation and the interaction will become smaller. 
For slow variations the electrons’ velocity will vary in phase with the modulation 
of the interacting wave; as the frequency is increased, however, the electrons’ 
velocity will tend to lag in phase behind the modulation of the wave, and when the 
frequency is great the phase-lag will be 

The original papers of Bailey & Martyn contained the deductions mentioned above, 
but the theory is here restated in detail in § 6 and appendix A in a form which relates 
it more closely to modern theories of the ionosphere. If the theory is correct it might 
be expected that an experimental study of wave interaction would yield valuable 
information about the absorbing portions of the ionosphere, and in particular about 
those parts of the ionosphere whioh absorb both waves strongly. It is the purpose 
of this paper to describe experiments made with this object, and to disouss the 
results obtained. 

Before the present work was started the only experimental test of the theory of 
Bailey & Martyn was one in which the amplitude of the transferred modulation .was 
measured at different frequencies (van der Pol & van der Mark 1935). It was found 
that the shape of the curve relating amplitude and frequency agreed with the theory 
but the test was admittedly somewhat indirect, and van der Pol stated (van der Pol 

1934): 

‘Finally in our opinion it cannot be considered to have been proved definitely 
that the effect is of a typioal non-linear nature. The possibility is still open for an 
explanation on the basis of a Doppler effect, viz. the periodic raising of the oon- 
duoting layers of the ionosphere in the rhythm of the modulation of the unwanted 
station and restored by the molecular agitation, thus causing a frequency or phase 
modulation of the wanted wave, which may in turn cause an amplitude modulation 
at the receiver due to the superposition of this frequency modulated wave with other 
rays whioh have not been affected on their oourse.’ 

It therefore seemed desirable to make a more direct test of the absorption hypo¬ 
thesis, and two such tests are described in § 3 . The first of these consisted in switching 
the interacting wave on and off and observing that the wanted wave was weaker in 
the presence of the interacting wave. The seoond was suggested by Appleton (1938) 
and involved a comparison, at low modulation-frequencies, of the phases of the 
modulation received direct on the interacting wave and received after being trans¬ 
ferred to the wanted wave by the prooess of wave-interaction. Both these tests 
supported the hypothesis of the absorption mechanism. 

Once interaction had been demonstrated by switching the interacting wave on 
and off it was no longer necessary to modulate that wave to make observations, and 
it was therefore possible to use wave-lengths outside the broadcast band, where the 
transmitting stations available are not usually capable of being modulated. Experi¬ 
ments on these wave-lengths are described in § 5 . 

* For a proof of these statements see appendix A. 



314 


J. A. Ratcliffe and I. J. Shaw 


When a modulated interacting wave was used interesting information was 
obtained by varying the modulation frequency and comparing the phase of the 
modulation received direct from the interacting station with that of the modulation 
transferred to the wanted wave. As the frequency is changed the phase difference 
changes for two different reasons: (a) the phase change associated with the relaxation 
time occurs as previously described, and as analyzed in detail in appendix A; and 
( b ) a progressive phase change occurs associated with the different lengths of the 
two paths labelled IAR and I BCR in figure 1. It'has proved possible to separate 
the two different effects and to deduce separately the relaxation time of the electrons 
and the difference between the paths IBCR and IAR. The path WBCR, of the 
wanted wave, was determined, in a separate experiment, by causing the transmitter 
of the wanted wave to emit pulses of the type usual in ionospheric research and it 
was then possible to locate the region ( 5 ) of interaction. Experiments based on the 
principles here outlined are described in § 4 . 

In the estimation of the relaxation time of the electrons it is also of value to observe 
the way in whioh the amplitude of the imposed modulation varies with frequency. 
A special method developed for measuring this amplitude is described in § 5 . 

The evidence from the different types of experiment is collected and discussed 
in § 7 . In this discussion we shall make use of a quantity which we have called the 
‘coefficient of transferred absorption’ which is a measure of the fractional decrease 
whioh occurs in the field strength of the wanted wave when a steady interacting 
wave is suddenly switched on. 


2 . Arrangement of transmitters and receivers 

For experiments of this kind it is necessary to use transmitters of suitable 
wave-length and considerable power situated in suitable plaoes, and we were most 
fortunate in obtaining the enthusiastic co-operation of the who provided the 

necessary transmissions. We wish to record our thanks to the Chief Engineer and 
all those on his staff who helped in the organization and carrying out of the com¬ 
plicated series of experimental transmissions for which we asked and, in particular, 
we should like to thank Mr H, L. Kirke, head of the B.B.C. Research Department, 
for his willingness to help on all occasions. We also wish to thank the Chief Engineer 
of the Post Office and his staff, and the Director of Scientific Research at the 
Admiralty, for putting their most valuable facilities at our disposal. Without the 
help of these organizations the series of experiments here described would have 
been impossible. 

The main series of observations was made at Cambridge (lat. 00 ° 05 'W M long. 
52 ° 10'N.). Dr L. G. H. Huxley made some additional observations on the same 
transmissions at Birmingham and we have had the benefit of comparing his results 
with ours. The transmitters which were used at different times are listed in table 1 
and their positions indicated in the map of figure 2. 



A study of the interaction of radio waves 315 


Table 1. Details op transmitters used 





designation 



frequency 

nature 

name or 


lat. and 

on map of 

controlling 

power 

in 

of trans¬ 

call sign 

place 

long. 

figure 2 

authority 

in kW kcyo./sec. 

mission 

Ottringham 

Ottringham 

00° 04' W. 
53° 42' N. 

O 

B.B.C. 

167 

167 

modulated 

Droitwich 

Droitwich 

02° 06' W. 
52" 18' N. 

D(200) 

B.B.C. 

150 

200 

modulated 

Weaterglen 

Westerglen 

03° 49' W. 
55° 59' N. 

W 

B.B.C* 

60 

767 

modulated 

Lisnagarvey 

Lisburn 

06° 03' W. 
54 u 29' N. 

L 

B.B.C. 

100 

1050 

modulated 

Droitwich 

Droitwich 

02° 06' W. 
52° 18' N. 

D(1013) 

B.B.C. 

68 

1013 

modulated 

a by 

Rugby 

or 20' w. 

52° 15' N. 

GRY 

Post Office 

80 

60 

single 

sideband 

telephony 

GYA2 

Cleethorpes 

00° 10' W. 
53° 30' N. 

GYA2 

Admiralty 

‘V 

20 

90-2 

keyed 



scale 


Figure 2. Locations of transmitters and receivers 



316 J. A. R&tcliffe and I. J. Shaw 

3. Tests of the absorbing mechanism 

If wave interaction is explained by the absorption theory of Bailey & Martyn, 
then when the interacting wave is strong we should expect the wanted wave to be 
weak, and Appleton (1938) has pointed out that the modulation of the interacting 
wave and the modulation imposed on the wanted wave should therefore be in 
antiphase. It is shown in the next section that this antiphase relationship would 
only be expected if the modulation frequency were low enough, and it is demon¬ 
strated that a frequency of 50 eye./sec. satisfies the necessary conditions. Special 
attention has therefore been paid to the observation of the phases at this frequency. 
On all occasions, and with all the combinations of wanted and interacting trans¬ 
mitters which are described later, it has been found that the modulation received 
direct from the interacting station is in opposite phase to that transferred to the 
downeoming wave from the wanted station.* This observation provides one funda¬ 
mental check of the absorption theory of wave-interaction. 


interacting 

wave 


wanted 

wave’ 



Figubh 3. Effect of switching the interacting station on and off. In this record the wanted 
signal is fed to the recording galvanometer through a condenser so that only the sudden 
changes of amplitude, at the instantH of switching the interacting wave on or off, are recorded. 


A further very direct test of the absorption theory is provided by arranging that 
the sender of the interacting wave is not modulated, but is simply switched on 
and off. When it is on the wanted wave should be weaker and when it is off it should 
be stronger. This test has been made with the two combinations of (a) Westerglen 
(wanted) and Ottringham (interacting); and ( b ) Lisnagarvey (wanted) and Droit- 
wich 200koyc./sec. (interacting). The change of amplitude was measured by 
observing the rectified current on a galvanometer arranged so that the greater part 
of the standing current was ‘backed off’. The wanted wave was always fading and 
it was necessary to keep the f backing off’ arrangement adjusted by hand so that the 
reading was always on the scale of the galvanometer. There was a clearly marked 
decrease of amplitude when the interacting station was switched on and an increase 
when it was switched off, as required by the theory. 

* In appendix B we discuss some complications which may arise if a ground wave is received 
in strength comparable with the downeoming wave. 



317 


A study of the interaction of radio waves 

The change in amplitude A F of the radio-frequency e.m.f. was determined as a 
fraction of the total e.m.f. F . We shall call the fraction AFjF the ‘coefficient of 
transferred absorptionand its magnitude will be further discussed in § 5. 

Although the method of the ‘ backed-off J galvanometer proved most satisfactory 
for measurement of the change of amplitude, the occurrence of fading rendered it 
less useful for providing a permanent record of the effect. A photographic record 
was therefore obtained by the use of a rapidly moving galvanometer which was fed 
from the receiver through a condenser of such a value that the slower changes 
representing fading were less evident in comparison with the rapid changes produced 
by keying the interacting transmitter. A record obtained in this way is reproduced 
in figure 3. 

The success of these two very direct tests of the absorption theory of interaction 
has satisfied us that the theory is correct, and has encouraged us to use it in making 
the detailed deductions explained in the rest of this paper. 

4. Observations of phase 

Since it was desired to compare the phase of the transferred modulation with 
that of the wave reoeived over a known direct path from the interacting station, it 
was essential to ensure that the downcoming wave alone was received from the 
wanted station and the ground wave alone from the interacting station. In the two 
main series of experiments to be described in this section the transmitter of the 
wanted wave was either Westerglen (frequency 767 kcyc./sec., distance 480km.) or 
Lisnagarvey (frequency 1050kcyc./sec., distance 495km.) and, since the experi¬ 
ments were all done during the night or at dawn, the downcoming wave was con¬ 
siderably stronger than the ground wave. It wAs therefore sufficient to receive it 
on a simple aerial; a loop was often used so that its directional properties could be 
used to reduce interference from other transmissions. In some subsidiary experi¬ 
ments the wanted station was nearer (e.g. Droitwioh distant 180 km., transmitting 
either on 1013kcyc./sec. or on 200kcyc./sec.) and in order to receive only the' 
downcoming wave it was necessary to use a loop aerial adjusted, by day, to suppress 
the ground-wave. 

In the main series of experiments the interacting transmitter was in the long-wave 
band and was distant 100 or 200 km. so that its ground wave was considerably 
stronger than the downcoming wave. A vertical aerial, which discriminates to some 
extent against the downcoming wave, was always used. No deductions about phase 
were made from any subsidiary experiments in which the interacting station pro¬ 
duced a strong downcoming wave at the receiver. 

The two waves were received on two separate radio-frequency amplifiers (three 
stages of straight amplification) each followed by a diode detector and one stage of 
audio-frequency amplification. The audio-frequency amplifiers were designed so 
that their response was constant over the range 50 to 2000 cyc./seo. and so that the 
phase-shift introduced by them was small. Moreover, both amplifiers were similar 


21-2 






319 


A study of the interaction of radio waves 

so that any residual phase-shift would be the same in each. The outputs of the two 
amplifiers were applied to deflect the two traces of a Cossor double-beam cathode-ray 
oscilloscope and were viewed on a linear time-base synchronized to the modulation. 
The equipment was tested by the application of radio-frequency e.m.f.’s generated 
in local signal-generators, modulated from a common source, and the amplitude of 
the wave-trace on the oscilloscope was shown to be proportional to the modulation 
amplitude of the input and independent of the frequency. The two traoes were shown 
to be in phase, and to remain in phase over the operational frequency range. Tests 
were also made to demonstrate that no interaction occurred between the two 
amplifiers, and possible interaction in the oscilloscope was guarded against. 

The method of experimentation usually adopted was to arrange for the wanted 
wave to be unmodulated, and for the interacting wave, usually with a power of 100 
or 150kW, to be modulated to a depth of 80 % with a series of pure tones having the 
frequencies 50, 75, 100, 150, 200, 300, 400, 500, 750, 1000, 1250, 1500, 1750, 2000 
eye./sec., the whole sequence of operations occupying about half an hour. The traces 
on the oscilloscope were either observed visually, or photographed and afterwards 
measured up, and the phase difference at each frequency was determined. A. good 
measure of the phase difference could usually be obtained on all frequencies less 
than about 1000eye./sec. Examples of the type of trace observed are shown in 
figure 4. 

When the observed phase-difference was plotted against modulation frequency 
a curve of the type shown in figure 5 was obtained. This curve may be explained 
in terms of the ideas outlined in § 1 in the following way. We show in appendix A 
that, as a result of the finite ‘relaxation time’ r of the ionospheric electrons, the 
transferred modulation at a frequency / has a phase-lag <j) x = tan" 1 (27r/r). If, also, 
there is a difference d between the two paths IAR and l BCR of figure 1, there is an 
additional phase-lag given by <f> 2 = 2 nfdjc. The total phase-lag <f> is then the sum 
of the ‘electron phase-lag’ (j) x and the ‘path phase-lag’ 0 2 , so that 

<j> - tan” 1 (2nfr) + 2nfdjc . 

When a set of experimental points had bgen plotted as in figure 5 the constants 
r and d were chosen so as to give a theoretical curve lying smoothly among the points. 
The curve in figure 5 is drawn for r = 1*8 x l()~ 3 sec. and d ~ 222 km. In figure 6 
the ‘path phase-lag’ appropriate to this distance has been subtracted from the 
experimentally observed phase-lags and the resulting ‘observed’ values of <p x 
plotted as points. The curve represents <p x = tan ” 1 (27r/r) for r = 1-8 x 10 ~ 3 sec. and 
the accuracy with which the points fall on it provide a measure of the reliability of 
the deductions. 

The composition of the downcoming wave was frequently investigated, during 
a night’s observations, by arranging that pulses of duration about 100 /isec, were 
emitted from the wanted station and observed at the receiver on an oscilloscope 
with a time-base synchronized, by way of the national ‘grid’ system of electric 
supply, to the emission of the pulses. A measurement of the intervals between the 



320 


J. A. Ratcliff© and L JT. Shaw 


pulses sufficed to decide whether a given pulse was reflected from region E or region F 
but it was not usually possible to make measurements with sufficient accuracy to 
provide a reliable measure of the height of the reflexion; this is particularly the case 
because the difference of path between the different multiple reflexions arriving at 
oblique incidence is not very sensitive to variations of the height of reflexion. We 
have therefore usually assumed that reflexions from region E occurred at a height 
of 100 km. and those from region F at 250 km.; these represent average heights of 



Figure 5. Experimental results obtained at 22.30 to 02.30 hr. g.m.t. on 4/5 May 1947, 
with Westerglen as wanted station and Ottringham as interacting station. 



Figure 6 . Curve of electron phase-lag against frequency. Experimental points marked 0 
are deduoed from figure 5; the full line is the curve 0 =s tan -1 (2ff/x 1*8 x 10"*). 

reflexion for waves of this frequency incident obliquely, as deduced from the routine 
ionospheric observations carried out at the Radio Research Station, Slough (lat. 
00 ° 45' W., long. 51° 25' N.).* Our measurements of echo-patterns have fitted these 
values quite well. Comparison of successive reflexions from the same region also 
provided a measure of the absorption; this information will be used in the discussion 
contained in § 7. 

* We are much indebted to Dr R. L. Smith-Rose for supplying us with detailed information 
for the occasions of our experiments. 



321 


A 'Study of the interaction of radio waves 

Some observations of the pulse transmissions have been made in the presence of 
a modulated interacting wave and the modulation transferred to one single reflected 
pulse, generally the first from region E, has been observed. It is probable that useful 
information could be derived from further observations made in this way. 

In the theoretical interpretation of the curves of figures 5 and 0 we have 
assumed that there is one predominant downcoming wave from the wanted station 
and that all the interaction occurs at one point on its trajectory. The more com¬ 
plicated situation arising if these assumptions do not hold is analyzed in appendix B. 
On some occasions we have noticed that the phase of the transferred modulation 
alters during the 2min. period of the observation; this is explained in the appendix 
as being due to the presence of two downcoming waves comparable in intensity. 
In general, however, the phase was constant during a period of observation, as we 
should be led to expect from the pulse transmissions which usually showed the first 
echo to be much stronger than the others. 


F Region 



Figure 7. Possible locations of wave-interaction deduced from experiments, summarized 
in figures 5 and 6, which give an average path difference of 200 km. 

From curves such as that of figure 5 the path difference d was deduced, and since 
the distance 1AR in figure 1 was known, the distance I BCR was calculated. Let 
us call this distance x. A series of possible wave-trajectories from the wanted station 
to the reoeiver was then drawn, as indicated in figure 7, and points (such as 1, 2, 3) 
on these trajectories were found which gave the measured value of x. The points 
marked in figure 7 correspond to the experimental results summarized in figure 5, 
in which Westerglen was used as the wanted station and Ottringham as the inter¬ 
acting station. In choosing amongst these possible positions for the origin of the 
wave-interaction we are now guided by the expectation that the interaction will be 
greatest where there is the greatest amount of absorption common to the two waves, 
and this region is just below the point of reflexion in region E. In the case illustrated 
in figure 7 we therefore select the point 1 as the probable location of the interaction. 




322 


J. A. Ratcliff© and I. J. Shaw 


Two series of experiments of the type outlined above have been carried out with 
the object of locating the region where the interaction occurs. In the first series the 
wanted wave was trasmitted from Westerglen and the interacting wave from 
Gttringham. The results are summarized in table 2 and figure 8. In the second series 
the wanted wave was transmitted from Lisnagarvey and the interacting wave from 

Table 2. Results of experiments in which Westerglen sent the wanted 

WAVE AND OTTRJNGHAM SENT THE INTERACTING WAVE 

value of 






value of 

Gv («l/r) 




height of 

Gv (=1 It) 

from 

number 



interaction 

from phase 

amplitude 

of 

date 

time 

region 

curves 

curves 

experiment 

(1947) 

(g.m.t.) 

(km.) 

(sec. -1 ) 

(sec. -1 ) 

1 

10 January 

01.00-02.30 

95 

6231 

725* 

2 

11 January 

01.00-02.30 

95 

895J 

* 3 

4/5 May 

22.30-00.30 

90 

555 

— 

4 

5/6 May 

22.30-00.30 

85 

1260 

— 


* Average for two nights. 


Table 3 a. Results of experiments in which Lisnagarvey sent the wanted 

WAVE AND DROITWIOH (200 KCYC./SKC.) SENT THE INTERACTING WAVE 

value of 






value of 

Gv ( = 1 fr) 




height of 

Gv ( = 1/r) 

from 

number 



interaction 

from phase 

amplitude 

of 

date 

time 

region 

curves 

curves 

experiment 

(1947) 

(g.m.t.) 

(km.) 

(sec. -1 ) 

(sec. -1 ) 

5 

15/16 June 

22.30-02.30 

77*5 

25201 

1640f 

6 

16/17 June 

22.30-02.30 

83-5 

1640/ 

7 

17/18 June 

22.30-02.30 

75*0 

1260 

— 

8 

3/4 July 

22.30-02.30 

91*5 

1320 

— 

9 

4/5 July 

22.30-02.30 

97*0 

2890 

— 

10 

6/6 July 

22.30-02.30 

78*0 

1290 

— 

li 

6/7 July 

22.30~02.30 

87*0 

1010 

— 

12 

6 August 

23.00-23.30 

84*0 

890 

— 

13 

7 August 

00.10-00.30 

80*0 

720 

— 


t Average for three nights. 


Table 3 b . Results of experiments in which Lisnagarvey sent the 

WANTED WAVE AND OTTRINGHAM THE INTERACTING WAVE 

value of 
value of Qv (= l jj) 





height of 

Gv (= 1/r) 

from 

number 



interaction 

from phase 

amplitude 

of 

date 

time 

region 

curves 

curves 

experiment 

(1947) 

(G.M.T.) 

(km.) 

(sec. -1 ) 

(sec. -1 ) 

14 

15/16 June 

22.30-02.30 

85 

— 

— 

15 

16/17 June 

22.30-02.30 

92 

1570 

— 



323 


A study of the interaction of radio waves 

Droitwich or Ottringham. The results are summarized in table 3 and figure 9. In 
tables 2 and 3 the results of experiments are expressed in terms of the quantity 1/r, 
since it is shown, in equation (21) of appendix A, that this is equal to the theoretically 
important quantity Gv. 

In the first series of experiments the interacting station was nearly on the straight 
line joining the receiver and the wanted station, in the second series the interacting 
station was some considerable distance from this line and the appropriate solid 
geometry had to be taken into account in calculating the location of the interaction. 

The results of the phase measurements, as summarized in tables 2 and 3 and 
figures 8 and 9, are discussed in § 7. 



Figure 8 . Locations of observed interaction of Ottringham on Westergien (see table 2). 



Figure 9. Locations of observed interaction of Ottringham and Droitwich on Lisnagarvey 
(see tables 3 a and 36). The points O' and I)' represent the feet of the perpendiculars from 
Ottringham and Droitwich to the line joining Lisnagarvey and Cambridge. 

5 . Magnitude of the interaction 

In order to measure the relative magnitude of the transferred modulation, a 
simple method was devised which gave good results even with a wave which exhibited 
marked fading. It was arranged that the wanted wave was weakly modulated with 
an amplitude of 5 or 10 % at a frequency somewhat different from the modulation 
frequency of the interacting station, so that, at the receiver, the normal modulation 
of the station was observable in addition to the modulation imposed by interaction. 
The trace on the oscilloscope then took the form shown in figure 10, in which the 
distance AB corresponds to the directly-impressed modulation and the distance 
CD to the transferred modulation. From oscillograms of this kind it was a simple 
matter to deduce the percentage modulation due to interaction. Although the 



324 


<L A, Ratcliffe and I. J* Shaw 


percentage of direct modulation as observed at the receiver changed, due to inter¬ 
ference effects between different reflected waves, and although the amount of inter¬ 
action also varied as the wave faded, it was found that the mean readings taken in 
this way were fairly constant and appeared to be reliable. They also agreed with 
direct measurements made by recording simultaneously the rectified current, which 
gave a measure of the carrier intensity, and the amplitude of the oscilloscope trace, 
and then calibrating with a modulated e.m.f. from a signal generator. Measurements 
made in this way led to the results described in the following sub-sections. 


| 


interacting 

wave 


AB 


CD 


wanted 

wave 


Fiutnuc 10. Photograph of oscilloscope trace to illustrate the method of measuring small 
amplitudes of interaction-modulation. The distance A B corresponds to 5 % direct modulation 
at 500 eye./sec. on the wanted carrier from Lisnagarvey and the distance CD to the trans¬ 
ferred modulation. The interacting station was Droitwich (200 kcyc./sec.) modulated at 
100 eye./sec. 

(a) Magnitude of interaction as a function of modulation frequency 

The results of a typical series of measurements made as the modulation frequency 
was varied are shown in figure 11 and are similar to those obtained previously by 
other workers (van der Pol & van der Mark 1935 ; Huxley et al. 1947 ). 

Equation ( 20 a) of appendix A shows that we should expect the coefficient of 
modulation (m) superimposed on the wanted wave to be given by the expression 

m ~ + fai*)} {1 + 4^ 2 / 2 r 2 }“*. ( 20 a) 

In this expression T is the ‘ coefficient of transferred absorption ’ defined by equation 
( 8 ) of § 6 ; // is the coefficient of modulation of the interacting wave modulated at 
frequency/; and r is the relaxation time of the ionospheric electrons. 

That the experimental points plotted in figure II show a reasonable agreement 
with theory is seen by comparing them with the theoretical curve which has been 
plotted for r » 5-8 x 10" 4 sec. On all occasions when the intensity of the interaction 
has been determined as a function of frequency in this way the agreement with 




325 


A study of the interaction of radio waves 

theory has been similar to that shown in figure 11 and the values obtained for the 
relaxation time r are shown in tables 2 and 3. These values are discussed in §7, 
An examination of all our results shows that the agreement with theory is not 
so good in the case of amplitude measurements as it is in the case of phase 
measurements. This difference, which may be seen by comparing figure 11 with 
figures 5 and 6 , is difficult to explain. 



Figure 11. Percentage transferred modulation as a function of modulating frequency. 
Results obtained on 16/17 June, with Lisnagarvey sending the wanted wave and Droitwich the 
interacting wave. Experimental points 0; full curve theoretical values for r = 6*8 x 10^ sec. 

If, in a plot such as that of figure 11 , the curve showing best theoretical fit is 
extrapolated to zero frequency we obtain a value for the quantity T{2/i/{l + £//, 2 )} 
and hence, since p, is known, for T. It has been confirmed that the value of T 
obtained in this way is in good agreement with the value obtained from measure¬ 
ments in which the unmodulated interacting carrier was switched on and off 
(see §3). 

We have made enough measurements of modulation amplitude as a function of 
modulation frequency to satisfy ourselves that the results are approximately 
represented by equation ( 20 a) and we shall usually express the absolute magnitude 
of the interaction effect by stating the value of the * coefficient of transferred absorp¬ 
tion’, T , deduced from curves of the type shown in figure 11 , or measured directly 
by switching the interacting station on and off. 

( 6 ) Coefficient of transferred absorption as a function of the 
power of the interacting station 

The theory of Bailey & Martyn ( 19346 ) would lead to the expectation that the 
‘ coefficient of transferred absorption ’ should be proportional to the radiated power of 
the interacting transmitter, but some tests by Grosskopf ( 1938 ) did not, apparently, 
confirm this expectation. We therefore conducted a special experiment in which the 
interacting wave was emitted from Ottringham and was modulated at a constant 
frequency of 400 eye./sec. and the modulation impressed on a wanted wave emitted 
by Westerglan was measured, by the method described above, for different powers 



326 


J. A. Ratcliffe and I. J. Shaw 


of the interacting transmitter. The results are shown in figure 12 and appear to 
agree well with the theoretical expectation, in contrast with the results of Grosskopf. 
We have little doubt that the facts are in agreement with theory in this respeot, but 
cannot suggest why the other workers found a different result. 



Figure 12. Kesults of varying the power of the interacting station, obtained at 00.30 hr. 
g.m.t. on 5 to 9 May, with Westerglen sending the wanted wave and Ottringham the 
interacting. 


Table 4. Observed values of coefficients of transferred absorption 


sender of 
wanted wave 


fre* 


serial 

symbol 

quency 


num¬ 

on 

(koyc./ 

power 

ber 

figure 2 

BOC.) 

(kW) 

1 

W 

767 

60 

2 

w 

767 

60 

3 

L 

1050 

100 

4 

L 

1050 

100 

5 

L 

1050 

100 

6 

L 

1050 

100 

7 

D(200) 

200 

170 

8 

D(1013) 

1013 

60 

9 

D(200) 

200 

170 

10 

0 

167 

167 


sender of 
interacting wave 


r 

fre- 


symbol 

quency 


on 

(kvyc./ 

power 

figure 2 

sec.) 

(kW) 

O 

167 

167 

GYA2 

90-2 

20 

D(200) 

200 

170 

D{1013) 

1013 

60 

GBY 

68 

80 

O 

167 

167 

D(1013) 

1013 

60 

D(200) 

200 

170 

O 

167 

167 

(D200) 

200 

170 


coefficient of 
transferred 
absorption 

reduced 

to 

100 kW, 

method of 

observed 

150 km. 

observation 

0-08 

0*05 

modulation 
and keying 

<0*001 

<0*005 

keying 

0*04 

0*023 

modulation 

0*03 

0*029 

modulation 

<0*001 

< 0002 

keying and 
modulation 

0*01 

0*011 

modulation 

0*02 

0*024 

modulation* 

0*04 

0*029 

modulation* 

0*04 

0*029 

modulation* 

0*02 

0*014 

modulation* 


* Loop: ground ray suppressed, 



327 


A study of the interaction of radio waves 

(c) Coefficient of transferred absorption with different pairs of stations 

An attempt was made to measure the * coefficient of transferred absorption ’ with 
different pairs of sending stations to investigate in detail the effect of the radio 
frequencies and geographical positions of the senders. The powers of the interacting 
stations were not the same, and in order to reduce the results to a common basis for 
comparison the measured coefficients were expressed, by using the results of sub¬ 
section (6), in terms of an interacting transmitter of power 100 kW, distant 150 km. 
from the point of interaction. We shall refer to these modified coefficients as ‘equi¬ 
valent coefficients of transferred absorption’. The results obtained are shown in 
table 4. 

Most of the measurements were straightforward and were made by the methods 
described in previous paragraphs. In one or two cases, however, special arrange¬ 
ments were used; thus when the morse station GYA2 was used as sender of the 
interacting wave it was not possible to modulate it and the tests had to be made by 
keying it on and off. The station GBY is constructed for single sideband telephony 
and the tests in which it emitted the interacting wave were made in two different 
ways. In one an audio-frequency tone was injected into the speech circuit and was 
keyed on and off; this produced an intermittent sideband at one frequency which 
simulated a keyed C.W. signal. In the other method the audio tone was itself modu¬ 
lated at 50 eye./sec. so that the single sideband emitted was effectively a single 
carrier modulated at 50 eye./sec. 

The ultimate sensitivity of observation was set by the random noise received and 
when no interaction was observed an up|>er limit is given in table 4. The arrange¬ 
ment could have been made more sensitive by the use of narrow-band audio filters 
and it is proposed to use this method in future work. 



time (g.m.t.) 


Figure 13. Decrease of interaction as dawn advances. Results of 7 July, with Lisnagarvey 
sending the wanted wave and Droitwioh the interacting. Sunrise in the ionosphere at 100 km, 
was at 02.30 hr. g.m.t. 



328 


J. A, Ratcliffe and I. J, Shaw 

(d) The effect of sunrise on the coefficient of transferred absorption 

It is well known that the intensity of downooming waves decreases near sunrise 
due to increased absorption, and it is of interest to measure the coefficient of trans¬ 
ferred absorption throughout this period. Experiments were made with Lisnagarvey 
as wanted station and Droitwioh (200kcyc./sec.) as interacting station during the 
sunrise period on 7 July 1947.* The wanted station was modulated to a depth of 
10 % (or sometimes 5 %) and the comparison method previously outlined was used. 
The results are shown in figure 13 from which it can be seen that the coefficient of 
transferred absorption was greatest during the night and became unmeasurably 
small during the hour before ground sunrise. 

6 . Theoretical considerations 

We shall now oonsider, in the light of present-day theories of absorption, the 
relation between interaction phenomena and the absorption of waves passing 
through the ionosphere. We shall calculate the magnitude of interaction effects in 
two parts. In the first part, contained in this section, we shall calculate the inter¬ 
action effect at zero modulation frequency (corresponding to the switching on and 
off of the interacting wave), and shall express it in terms of a coefficient of trans¬ 
ferred absorption (T), which we have already defined. In the calculations we shall 
refer to our ideas about the absorption of waves, so that our calculations, although 
allied to those of Bailey & Martyn, will be expressed somewhat differently. The 
second part of the calculation conoerns the effect of the modulation frequency and 
is exactly that previously given by Bailey & Martyn. For the sake of completeness 
it is included, with our terminology, in appendix A. 

We shall use the following nomenclature, in which we adhere to that used by 
Appleton in his theories of the ionosphere: 
c « velocity of light, 
e * electronic charge. 

/ « modulation frequency. 

F « amplitude of received wave. 

F 0 a amplitude of received wave in absence of attenuation. 

kF = decrease in amplitude of received wave. 

G a proportion of energy lost by an electron in a collision. 
k = Boltzmann’s constant. 

I == mean free path of electron. 
m = mass of the electron. 

N a number of electrons per c.c. 
p == angular frequency of the carrier. 

* The exigencies of the broadcasting service, in conjunction with the use of Double British 
Summer Time, rendered it impossible to do these experiments except near midsummer. 



A study of the interaction of radio waves 329 

Pl 33 H L elmc = angular gyro-frequency of electrons in the longitudinal com¬ 
ponent of the earth’s magnetic field, 

p T — H T efmc =s angular gyro-frequency of electrons in the transverse com¬ 
ponent of the earth’s magnetic field. 

P = power per unit area in the wave. 

T = absolute temperature. 
u = velocity of the electron. 

U * gas-kinetic energy of the electron. 

AU ~ increase in gas kinetic energy in presence of wave. 

W — work done by the wave on each electron per second. 
x = distance along wave normal. 
ct * a constant given by relation clv = p. 
e 0 s= permitivity of free space. 
k — absorption coefficient of wave per unit path. 
v a* frequency of collision of electrons with molecules. 

Av = change in collision frequency due to incident field. 
p *bs attenuation of wave in ‘nepers \ 


(a) The magnitude of wave-interaction 

The theory of ionospheric absorption is considerably complicated by the presence 
of the earth’s magnetic field, and for simplicity we first consider what would happen 
in the absence of that field. Investigations of absorption of radio waves in the 
ionosphere (Farmer & Ratcliffe 1935) have led to the view that the waves are most 
strongly absorbed at a level where the refractive index does not depart much from 
unity, and that then the absorption coefficient is given by: 

k « {2 nNe*/e 0 cm} {v/(p 2 4 - r 2 )}. (1 ) 

There are also reasons for thinking that all the important absorption occurs at 
levels where so that we may write approximately 

k = {27 rNe t je^cm}{vjp % }. (2) 

As it travels through the ionosphere the interacting wave is absorbed at each level 
as given by this expression, and the power lost by it represents a ‘heating up’ of 
the ionospheric electrons and an increase in their velocity. If P represents the power 
per unit area in the wave at any given level in the ionosphere, then it is absorbed near 
that level according to the law P « P 0 e~ a * x so that the power absorbed in a slab of 
unit cross-section and of thickness dx is given by 


dP 

dx 


dx — 2 KPdx, 



330 J. A. Eatcliffe and I. J. Shaw 

But this slab contains N dx electrons, so that the work { W) done per second on each 


electron is given by 


W = 2 tcP/N, 


and substitution from equation (2) gives us 

W « (4ne 2 le 0 mc){vlp 2 )P. ( 3 ) 

This work increases the gas-kinetic energy of the electron from the value 17 , 
which is in equilibrium with the gas molecules, to the value T 7 + AI 7 . It is known 
from theory and from laboratory experiments that the average energy lost per 
collision by an electron under these conditions is proportional to its excess energy, 
and can be represented by G A V where 0 is a constant which has been estimated by 
Townsend and his school from experiments on the diffusion of slow electrons in 
gases (Townsend 1925). When equilibrium conditions have been set up under the 
application of a steady wave we can therefore express the fact that the work done 
on each electron per second is equal to the energy lost in the v collisions made per 
seoond by writing W = vOAU. (4) 

We now wish to express energies in terms of collisional frequencies, so we relate the 
electron’s velocity u to its collisional frequency v and its mean free path l by the 
expression u = vl . We then have V — \mu % - {mPv 2 and A 17 - nd 2 vAv where Av 
represents the increase of v due to the interacting wave. Substitution in equation ( 4 ) 

leads 40 \v = W/mlWO. ( 5 ) 

We now consider the passage of the wanted w ave through the region which has 
been ‘heated up’ by the interacting wave. At each level in the ionosphere the 
absorption coefficient of the wanted wave is given by equation (2) and, if we assume 
that all the important absorption occurs near a fixed level, so that v is roughly 
constant over the absorbing region, we may write, for the received amplitude of the 
wanted wave, ,. (,, 

where p represents the attenuation of the wave measured in ‘nepers’ and olv = p. 
If now the presence of an interacting wave increases the frequency of collision from 
v to y 4- Av, the received amplitude is reduced to the value 

Jp' -s 

and, if aAv< 1 we may expand the factor e~ aAv and obtain 
F' = i* 0 e“ flW '(l-~aAy) - .F(l-aAi'). 

If AF represents the decrease in the amplitude, F, of the wanted wave we therefore 
have A FfF - aAv, (7) 

or, after substitution from equations ( 3 ) and ( 5 ), 

T = AFjF = a(47re s /e 0 cm s ) (P/p 2 ) (PQv)~ l . 


( 8 ) 



331 


A study of the interaction of radio waves 

We shall call the quantity A FjF the ‘ coefficient of transferred absorption * and shall 
write it as T. It represents the magnitude of the interaction effect at zero frequency. 
It is shown in appendix A that the coefficient (m) of modulation transferred from the 
interacting to the wanted wave at modulation frequency / is given by 

m - T{2/ij(l + &*)}{1 + 47 r 2 /V}“*, (20a) 

where p is the coefficient of modulation of the interacting wave and r is the relaxation 
time of the ionospheric electrons. 

It will sometimes be more convenient to substitute for a from equation ( 6 ) and 
write the fractional decrease of amplitude in the form 

T - A F/F = (pjv) (4 7 re 2 /e 0 cm a ) (P/p 2 ) (Mv)- 1 . (9) 

Another useful form of this expression containing the electron’s velocity u is obtained 
by using the relation u = vl and we then obtain 

T = A F/F — p(47Te 2 je 0 cm z ) (P/p 2 ) (Gu 2 )~ l , (10) 

or, with \mu 2 = §kT 

T = A F/F « p(4ne 2 le 0 cm) ( Pjp 2 ) {3GkT)~*. ( 11 ) 

Equation (7) may be rewritten 

A FjF =* x&v = p Ay/v. 

Measurement shows that when the interacting wave has a frequency of 200 kcyc./sec. 
and comes from a 100 kW sender at a distance of 150 km. A F/F is of the order of 
0*05 and with pH 3 we find A vjv is of the order 0 * 01 . Hence the incidence of such 
a wave on the ionosphere increases the collisional frequency by about 1 %. It 
follows that the effective temperature of the electrons increases by about 2 % when 
this wave is incident. 


( b ) Ejfect of earth's magnetic field 

To include the effect of the earth’s magnetic field in the calculations is, in general, 
complicated, but the salient points can be seen simply in terms of Appleton’s ( 1932 ) 
well-known magneto-ionic theory. The most important modification is to the 
absorption coefficient of the interacting wave as given by equation ( 2 ). This takes 
two forms according as the ‘quasi-longitudinal’ or the ‘quasi-transverse’ approxi¬ 
mation (Booker 1935 ) is made, and for each approximation we have to consider 
separately the absorption of the ‘ordinary’ and the ‘extraordinary’ wave of the 
magneto-ionic theory. The results of this consideration are shown in table 5, which 
lists factors which replace p~ 2 in equation ( 2 ) when the magnetic field is included. 

We first note that, for all directions of transmission, the extraordinary wave is 
strongly absorbed near the gyro-frequency {p - p L or p H ) so that we should expect 
to find a marked increase in the magnitude of wave-interaction when the frequency 
of the interacting wave approaches this value. Bailey (1937 a, b and c) has previously 


Vol. 193 A. 


33 



332 


J, A. Ratcliffe md L J. Shaw 

discussed this phenomenon theoretically and observed it experimentally, and has 
named it 4 gyro-interaction *. We are not further concerned with gyro-interaction 
in this paper. 

We next note, from table 5, that as the frequency p of the-interacting wave is 
decreased, the factor increases until, for cases ( 6 ), (c) and (e), p is somewhat less 
than the gyro-frequency, and then tends to a limiting value determined by the 
magnitude of p L or p T , or, roughly, the magnitude of p H . In the case, however, of 
the ordinary wave travelling in a direction which satisfies the quasi-transverse 
approximations (case (<£)), the factor, and therefore the magnitude of the wave- 
interaction, continues to increase as the frequency is decreased. We should therefore 
expect to find the magnitude of the interaction produced by a low-frequency wave 
to depend on the direction of propagation of that wave. 


Table 5. Factor to replace the factor y>~ 2 in equation (2) 



approximation 

factor 

(«) 

no steady magnetic field 

P~* 

(b) 

quasi -1 ongitudinal—ordinary wave 

(P + Pl )~ 2 

(<-■) 

quasi -longitudinal—extraordinary wave 

( P~Pl)~ 2 

(d) 

quasi-transverse—ordinary wave 

P~ % 

(*) 

quasi - transverse—extraord inary wave 

complicated expressions which can 



bo shown to give a maximum 
absorption at the gyro-frequency. 



and for low frequencies to tend 
to p- T * 


The results of this theoretical section as summarized in equations ( 8 ), (9), ( 10 ) 
and ( 11 ) which give the coefficient of transferred absorption in the absence of a 
magnetic field, and table 5, which gives the effect of introducing a magnetic field, 
will be used in the next section, to discuss the results of the measurements described 
in §5. 


7. Discussion of results 
(a) The determination of v from Ov 

We have previously given, in tables 2 and 3, the heights of the regions of inter¬ 
action, and the measured values of the electronic relaxation time r in these regions. 
It is shown in appendix A, equation ( 21 ), that r = IjOv so that as a result of the 
experiments the value of Op is known at a known height, and if the value of G can 
be assumed v can be determined. The best value of G , deduced from measurements 
in the laboratory, is taken by Bailey (1937 a) to be 2*6 x 10 ~ 3 , and this, with the 
results of tables 2 and 3, gives values for v ranging from 2*1 x 10 5 to 11 x 10 5 with 
a mean value 5x 10 6 sec." 1 . The measured heights of the interaction vary from 75 
to 97 km. with a mean of 86*4 km. It is clear from the tables that there is considerable 
variation in the measured heights and relaxation times, and some of this variation 



333 


A study of the interaction of radio waves 

is undoubtedly due to experimental errors. A phase experiment of the type discussed 
in § 4 takes about 20 min. with the present technique, and since the complications 
discussed in appendix B undoubtedly occur we do not feel justified in drawing more 
than general conclusions from the results. 

The broad conclusion, that v has the value 5 x 10 5 sec .^ 1 at a level of about 85 km. 
is in good accord with other data. In particular it agrees with that previously 
found by Bailey ( 1935 ) who based his results on the measurements which van 
der Pol & van der Mark ( 1935 ) made on the amplitude of the transferred 
modulation. 

(b) Comparison of theoretical and experimental values 
of the coefficients of transferred absorption 

The values of the ‘equivalent coefficients of transferred absorption’ calculated 
from equation (11), with suitable substitutions from table 5, are shown in table 6 
together with the observed values extracted from table 4. The value taken for p in 
the calculations is derived from observations of the pulses emitted by the wanted 
station. 


Table 6. Equivalent coefficients of transferred absorption for 100 kW 

SPHERICALLY RADIATED BY AN INTERACTING STATION DISTANT 150 KM. FROM 
THE POINT OF INTERACTION 


Calculations are from equation (11) and table 5, with the following values insertod: 

p — 3 t corresponding to a reflexion coefficient of 0-05 for the wanted wave at 
' oblique incidence. 

(2jrc 9 /e 0 cm) — 5*3 x 10*” 8 cm. 3 sec.” 8 . 

P =s 3*5 x 10” 4 ergs cm.”* sec.' 5 for 100 kW spherically radiated to a distance of 
150 km. 

G- 2-6 x 10”\ 

T = 200° K. 

k = 1*37 x 10~ lfl erg deg.” 1 . 


calculated 


frequency extraordinary ordinary 

of inter- quasi-longitudinal - - .« . .-. 

acting wave and approx. quasi- quasi- 

(kcyc./sec.) quasi-transverse longitudinal transverse 

1013 1*4 x 10” 1 2-4x10-* 1*3 xl0‘ 8 

200 . M x 10~* 5-8 x 10“ 3 3*3 x 10 1 

00*2 8*7 x 10"* 6*6xl0~« 1*6 

68 8*4 xlO” 8 6*9 xlO” 3 2-8 


observed values from 
table 4 (serial number of 
experiment in brackets) 

2*5 x 10~ 2 (4, 7) 

2*0 x 10”* (1, 3, 6, 8, 9, 10) 
<5 x 10-* (2) 

<2 x 10” s (5) 


In calculating from equation ( 11 ) we have assumed a knowledge of O and of the 
ionospheric temperature; we may, however, calculate from equation (9) and use the 
measured value of Ov so as to avoid any assumption about the value of G . We then 
have to assume reasonable values for v and for the mean free path l and we obtain 
a result of the same order, as is indeed obvious if we remember that the measured 
value of Gv agrees with accepted values of G and i\ 


22-2 



334 J, A* Ratcliff© and I. J. Shaw 

In attempting to relate the observed and experimental results we first recall 
that, in the theoretical calculations of table 6, we have assumed that the interacting 
wave arrives at the region of interaction with its power undiminished; if it suffers 
appreciable absorption before it gets there the interaction will be less than that 
calculated. 

We first consider the tabulated values of the coefficient of transferred absorption 
when the frequency of the interacting wave is 200kcyc./sec. since this represents 
the case which we have studied in most detail in our experiments. We find that the 
calculated values lie on each side of the observed value and differ from it by 
a factor of less than 10. In view of the uncertainty concerning the precise magnitude 
of O this agreement seems satisfactory and we do not consider it profitable, at this 
stage, to consider further the absolute magnitude of the coefficient. We do, however, 
attach considerable significance to the relative magnitudes of the coefficient deter¬ 
mined with interacting waves of different frequency, and shall examine these in 
more detail. 

It is first interesting to note that the coefficient measured for experiment (1) of 
table 4 was about twice that measured in experiments (3), (6), (8), (9) and (10). 
This difference is probably attributable to the fact that, at the interaction point 
operative in the first experiment, the interacting wave was travelling more nearly 
transverse to the magnetic field, and its polarization corresponded to that of the 
‘ordinary' wave of the magneto-ionic theory. 

We next examine the figures for the two low-frequency interacting waves and 
note that the observed coefficient of transferred modulation is less than 1/10 of that 
observed with an interacting wave of frequency 200kcyo./sec. We can at once reject 
the possibility of interaction produced by an ordinary wave propagated with quasi- 
transverse direction since this would lead to an excessive amount of interaction. 
We next note that the calculated magnitudes of the interaction on this frequency 
for the other directions of transmission are very little different from those calculated 
for 200kcyc./sec. We are driven to explain the much smaller observed interaction 
on the lower frequencies by the assumption that the full power of the inter¬ 
acting wave does not reach the region where the wanted wave is absorbed, either 
because the interacting wave is reflected or is considerably absorbed below this 
level. 

In contrast to the case just considered it is next of special interest to inquire what 
interaction occurs between two waves on nearly the same frequency, which would be 
expected to be absorbed nearly at the same level in the ionosphere. Experiments 
numbers 4, 9 and 10 of table 4 are of this type, and the results listed in table 5 
opposite the frequencies 1013 and 200 keyc./sec. summarize the observations. For 
the frequency 1013 kcyc./sec. it is seen that the observed value is in good agreement 
with the theoretical values deduced both for the extraordinary wave and for the 
ordinary wave on the quasi-transverse assumption. On a frequency of 200 kcyc./sec. 
the possible theoretical values embrace a wider range but even so there is no 
disagreement between theory and experiment. 



335 


A study of the interaction of radio waves 

(c) The possibility of self-interaction 

In the foregoing sub-section it has been emphasized that the maximum amount 
of interaction can only take place if the interacting and the wanted waves are 
absorbed in the same place, and the small interaction when the two wave-lengths 
have been different has been ascribed to the fact that the regions of absorption do 
not completely overlap. It is therefore of interest to inquire what would happen if 
the two wave-lengths were exactly the same and if they were travelling in the same 
direction so that the two absorption coefficients were identical at each point. The 
wanted and the interacting wave would then be one and the same, and the pheno¬ 
menon could be called self-interaction. We believe that useful information could 
be obtained by measurements of self-interaction. 

(d) Effects at sunrise 

The coefficient of transferred absorption as given by equations ( 8 ), (9), ( 10 ) and 
( 11 ), is dependent on the electron density N through the quantity />, which is the 
absorption coefficient of the wanted wave, and is proportional to N ; so that we might 
expect the coefficient of transferred absorption to increase as N increases with the 
approach of day, and as the absorption of the wanted wave increases. Grosskopf 
( 1938 ) looked for an effect of this kind but did not observe it. In §5(<2) we have 
explained how we observed precisely the opposite effect, as illustrated in figure 13, 
and it is now necessary to consider how this could happen. 

We first notice that, although the increase of electron density in a certain neigh¬ 
bourhood increases the total energy absorbed from the interacting wave it does not 
alter the work done on each electron by that wave (see equation (3)). So far as this 
effect goes, therefore, the coefficient of transferred absorption is not altered by an 
increase of electron density. If, however, the interacting wave is separately absorbed 
at some other level before it reaches the level where the interaction occurs then, 
of course, the power P in equation (3) is decreased, and, with it, the coefficient 
of transferred absorption. Separate absorption of the interacting wave at some 
other level after it has produced interaction does not alter the interaction in 
any way. 

Turning now to consider the wanted wave, we have already noticed that an 
increase in its absorption in the region of interaction will increase the coefficient of 
transferred absorption. An increase of its absorption outside that region is simply 
equivalent to a decrease in the power emitted by the wanted sender and has no 
effect on the coefficient. 

There are therefore two separate tendencies, acting in opposite senses: 

(а) Increased absorption of the wanted wave in the region where interaction 
occurs leads to an increased coefficient of transferred absorption. 

(б) Increased absorption of the interacting wave before it reaches the region of 
interaction leads to a decrease in the coefficient of transferred absorption. 

Other changes of absorption do not alter the coefficient of transferred absorption. 



336 J. A. Ratcliffe and I* J. Shaw 

We must conclude that, during the period of sunrise, phenomenon (6) more than 
compensates for phenomenon (a). A detailed discussion of the possibilities in the 
light of theories of ionospheric absorption should provide useful information about 
changes in the distribution of electron density during the period of sunrise. 


8. Conclusions and future experiments 

We have shown that, if the magnitude of the quantity 0 is assumed, measurements 
of the phase of thq, transferred modulation lead to a knowledge of the frequency v 
of collision between electrons and neutral molecules at a height which can be deter¬ 
mined. Our measurements have shown that the value of v is about 5 x I0 5 sec.“ 1 at 
a height of 85 km. They also indicate that this is the height at which the majority 
of the absorption occurs at night, and thereby confirm the suggestion that the main 
absorption does not occur at the top of the trajectory. In this respect our results 
differ from those of Bailey & Martyn (1934). 

Measurements of the absolute magnitude of the transferred modulation during the 
night showed reasonable agreement with theory in the cases where the wanted and 
interacting waves were near the same frequency, e.g. experiments with serial 
numbers 4 (1 Mcyc./sec.) and 9 and 10 (200kcyc./sec.) (table 4). Since we expect the 
absorption of two waves of the same frequency to occur near the same level we take 
the agreement as providing additional evidence for the correctness of the assumed 
value for 0 and the value deduced for v. 

We also find reasonable agreement with theory when one wave has a frequency 
about 1 Mcyc./sec. and the other about 200kcyc./sec. (experiments 1, 3, 6, 7 and 8). 
This leads us to believe that the waves of these two frequencies are absorbed near 
the same levels during the night. 

With interacting waves of frequencies 90 and G8kcyc./sec. the interaction is 
considerably less than would be expected and it appears that these waves must be 
reflected from heights less than 85 km., the level of absorption of the waves of higher 
frequency. 

We deduce from the decrease of interaction observed when dawn approaches that 
the levels of absorption for 200kcyc./sec. and 1 Mcyc./sec. separate considerably, 
although they had been roughly coincident during the night. This is presumably 
due to the establishment during the dawn period of a less sharp gradient of ionization 
in the region of 85 km. 

The measuring apparatus used in these experiments has intentionally been kept 
simple, and although well suited for establishing the main principles of the methods, 
suffered from the disad vantage that it required about half an hour to make a complete 
phase experiment. During this time ionospheric conditions may have changed so 
that no great accuracy is claimed for the values of Qv or of height given in tables 2 
and 3. We intend to modify the apparatus so that measurements can be made while 
the modulation frequency is changed quickly. 



337 


A study of the interaction of radio waves 

In addition to those organizations whose help has been acknowledged in § 2, we 
wish to express our thanks to several others for help they have given. We wish to 
thank the Department of Scientific and Industrial Research which has supported 
the work with a generous grant. One of us (I. J. S.) is indebted to the Government of 
New Zealand for a scholarship which has enabled him to work in the Cavendish 
Laboratory on this subject. We are specially indebted to Mr S, W. H. W. Falloon 
for considerable help with the apparatus and with the observations, to Mr K. Weekes 
and Mr J. W. Findlay for help in the observations of the pulse transmissions, and 
to Dr L. G. H. Huxley for allowing us to see the results of some observations which 
he made at Birmingham on the transmissions described in this paper. 


Appendix A 

The magnitude of wive-interaction as a function of modulation frequency 

In this appendix we extend the discussion of § 6 to include the case of a modulated 
interacting w r ave. The analysis was given in the original paper of Bailey & Martyn 
(1934ft), we merely repeat it here in order to relate it to the nomenclature which we 
have used. 

If there is a temporal variation in the power, expended on each electron 
• by the interacting wave, then the equation of energy balance (equation ( 4 ) of § 6 (a)) 
of the electrons becomes , 

W(t)^j t (MJ)^vGMJ, ( 12 ) 


or, if we substitute for Af 7 in terms of Av as in § 6 (a)\ 

W(t)/mPv = ~(Av) + vGAv. 


( 13 ) 


The power, W{t ), expended on each electron per second is related to the power-flux 
P in the interacting wave by the expression 

W(t) - CP(t ), 

where * C = {ine^je^mc) (v/p*) ( 14 ) 

(of. equation ( 3 )) and we shall write 

P(t) - P 0 (l + M coscot) ( 15 ) 

to represent a modulated wave. 

If we now substitute from equations ( 14 ) and ( 15 ) into ( 13 ) and solve for Av 
we obtain Ay - {CP 0 /mlWG)[l + M{} + (o>/(7y) s }-* cos («<-<*)], (10) 

where <j> = t&n - 1 (co/Gv). 

Substitution of this result into equation ( 7 ) together with expression ( 14 ) for C gives 
AF/F - a(47re»/e 0 cm*) (P 0 /p*) (l*Gv)~' [1 + M{1 + (w/Gv)*} -*cos(a( 17 ) 



338 


J, A, Ratcliffe and I. J. Shaw 


If the wave amplitude is sinusoidally modulated witl^a coefficient /i there will be 
an octave component in the modulation of the power, but if this is negleoted (as it 
was in our experiments) and we consider only the fundamental component we have 

M = 2^/(1 + (18) 

It should be noted that, although ju is always less than unity, M may be greater than 
unity. In our experiments, for example, p was 0*8 and M was therefore 1*21. 

If we combine expression (18) with expression (17) and substitute from equation 
( 8 ) for the coefficient of transferred absorption T t we find that 

AF/F = T[\ -f { 2 ///( I + */**)} {1 + (<o/^) 2 }~> cos (cot (19) 

Equation (19) shows that, if T<^ 1 the wanted wave is modulated with a modula¬ 
tion coefficient m given by 

m = T{2/il( 1 + */e*)}{l Hh (w/^) 8 }-*, (20) 

and the phase of this transferred modulation lags on that of the interacting modula¬ 
tion by the angle <j> = tan 1 (mjGv). 

The quantity \jGv plays an important part in the phenomena discussed in this 
appendix and it is of interest to give it a physical meaning. Reference to equation 
( 12 ) shows that, if the interacting wave is suddenly removed the electrons will lose 
their excess energy A U as given by 

j t (MJ) + vGAU = 0, 

or A U = (Ar/) 0 e-*»'| 

= (AlV-'" T j 

where r — \)Qv can be called the ‘relaxation time’ of the electrons 
use equation ( 20 ) written as 

m » T{2p/(l + ^ a )}{] K 


( 21 ) 

, We shall often 
(20a) 


* 

Appendix B 

Effects of transmission over multiple paths 

In this appendix we consider how the following complicating processes will 
modify the amplitude and phase of the transferred modulation measured at the 
receiver: 

(a) Two ionospheric waves reach the receiver by different paths and each has 
modulation transferred to it at some point along its trajectory. We shall restrict 
ourselves to the case where one of the waves is much weaker than the other. 

(b) A single ionospheric wave has modulation transferred to it at two different 
points along its trajectory. We shall restrict ourselves to the case where one modula¬ 
tion is much weaker than the other. 








340 J. A. Rateliffe and I. J. Shaw 


and we then find for the resultant e.m.f. the expression 

Mf(l +wtjCO8 0 1 ) 4 + A!(l +m g cos0 g ) s 
\ + 2^4 1 A g (l + m 1 costf 1 ) (1 + m g coB0 a )co8^ 


r 


coa (pt + x)> (24) 


where x A phase angle depending on the other variables. If we assume m x and 
w a <l, the amplitude A of this resultant radio-frequency e.m.f. may now be 
expressed as 


- j 


1 + (A JA 0 ) a m 1 cos 6 X + (AJA 0 ) 2 m 2 cos 0 2 
+ (A t A t JAl) {m x co&0 t + m % cos# 2 )cos^r 


i 


(25) 


where 


Al = A\ + A\ + 2A x A z coa^f. 



Figttkk 15. To illustrate meaning of equation (25). 


The last three terms inside the bracket represent the modulation of the wave 
which, in general, may be quite complicated. In order to simplify matters we shall 
assume that A 2 is small compared with A v so that the expression in the bracket 
reduces approximately to 

1 + m x cos 0 X 4- (AJA X ) m 2 cos ^ cos 0 2 . (26) 

The modulation then consists of two superimposed sinusoidal modulations with 
amplitudes m x and (AJA X ) m 2 cos \jr and with a phase difference given by 

S x *■* 0 2 35 w(d 2 *■"dj)/c. 







341 


A study of the interaction of radio loaves 

The resultant of these two simultaneous modulations is most simply visualized by 
a vector diagram such as that of figure 16 in which the vectors are supposed to 
rotate with the frequency of the modulation. It is then clear that the phase of the 
resultant modulation will differ by the angle ft from the phase of the modulation 




Piguhb 16. The expected effect of interaction on two simult&noously*-received downcOming 
waves. The solid line represents the theory for interaction at one point on a single downcoming 
wave. For interaction on each of two down coining waves the resulting curves lie somewhere 
inside the shaded regions, the precise position depending upon the radio-frequency phases 
of the two waves. 

due to A t alone. The magnitude of this phase difference depends on the quantity 
A % m t co&)JrlA 1 m 1 and on the phase difference (0 l -d i ) between the modulations 
received separately by the two paths. Small changes in the ionosphere will cause 
iff to assume a series of random values and will lead to changes in A x and A 2 ; as these 
ohanges occur the amplitude and phase of the received modulation will assume 



342 


J. A. Ratcliff© and I, J. Shaw 


values represented by vectors in the shaded triangle ABC . In any experiments, 
therefore, there will be an element of random ( scatter ’ amongst a set of points plotted 
to represent the phase or amplitude of the modulation. If the modulation frequency 
is altered as the experiments proceed, the magnitude of 6 x — 6% « d x )/c will 
alter, and when 6^ — 0 x = 0 the variations of A^m 2 QOH^rjA x m x will not affect the 




Figure 17. The expected effect of simultaneous interaction at two points on a single down- 
coming wave. The dotted lines represent the theory for interaction at one point, and the solid 
lines the results for interaction at two points. 

phase of the resultant modulation but will have maximum effect on the amplitude, 
whereas when 6 % --6 x - there will be a marked effect on the phase, but little on 
the amplitude. If interaction occurred, in the ideally simple way, at one point on 
one ray, the theories which we have previously given would lead us to expect the 
plots of experimental results to fall on curves typified by the solid lines of figure 16. 



343 


A study of the interaction of radio waves 

The occurrence of interaction on two transmission paths, between which there was 
a variable phase,, would cause the plots of experimental results to be scattered 
about these lines, in such a way as to lie always in the regions shown shaded. 

To consider case (6), where a single ionospheric wave has modulation transferred 
to it at two different points along its trajectory, we use precisely the same analysis 
as before and put A x = A 2 , m x %>m 2f and keep \jr constant. If, now, AJA 1 also 
remained constant we should find our experimental points lying on a curve such as 
that of figure 17. There may be some scatter around this curve as a result of changes 
in the magnitude of AJA X . 

We now consider case (c), where a single ionospheric wave, with modulation 
transferred to it at a single point in its trajectory, is received simultaneously with 
a ground wave which carries no transferred modulation. Our previous expressions 
represent this case if we take A x ( = ) A 2 and m 2 — 0, and equation (25) then becomes 

, A = A 0 \\ + {(A l jA () ) t +(A^t/AftcoB t/r}m 1 coa0 1 ]. 

This expression shows that the phase of the transferred modulation is not affected 
by the presence of the ground wave except that it may be completely reversed if cos \jr 
is negative and | A 2 cos xjr | > A x . It is important to remember this possibility when 
interpreting the absolute phase of the transferred modulation in the presence of 
a relatively strong ground wave received from the wanted transmitter. 


References 

Appleton, E. V. 1932 J. Instn Elect. Engrs , 71, 430. 

Appleton, E. V. 1938 U.R.S.l. Proc. (Venice and Home), 5, 23. 

Bailey, V. A. 1937a Phil Mag . 23, 929. 

Bailey, V. A. 1935 Nature , 135. 585. 

Bailey, V. A. 19376 Nature , 139, 68, 

Bailey, V. A. 1937c Nature , 139, 838. 

Bailey, V. A. & Martyn, D. F. 1934a Nature , 133, 218. 

Bailey, V. A. & Martyn, D. F. 1934& Mag. 18. 369. 

Baiunler, M. & Pfitzer, W. 1935 Hochfreqitcnztech. u. Elektroakust. 46 , 181, 
Booker, H. G. 1935 Proc. Roy. Soc. A, 150, 267. 

Farmer, F. T. & Ratcliffe, J. A, 1935 Proc. Hoy, Soc . A, 151, 370. 

Grosakopf, J. 1938 Hochfrequenztech . u. Elektroakust. 51, 18. 

Huxley, L. G. H., Foster, H. G. h Newton, C. C. 1947 Nature . 159, 300. 

van der Pol, B. 1934 U.H.S.I. Proc . (London) 4 . 

van der Pol, B. 1935 Tijdsc.hr. Ned. Radiogenoot. 7, 93. 

van der Pol, B. & van der Mark, «T. 1935 Tijdschr. Ned. Radiogcnoot. 7, 12. 

Townsend, J. S, 1925 Motions of electrons in gases. Oxford University Press. 



The excitation of nuclei by electrons 


By I. N. Sneddon and B. F. Tooschkk 
Department of Natural Philosophy, the University of Glasgow 

(Communicated by P, 7. Dee, F.R.8.—Received 30 November 1947) 


The cro&ft-seotion for the inelastic scattering of electrons by atomic nuclei is calculated 
assuming the process to arise from the electromagnetic interaction between electrons and 
protons. For fast electrons scattered by cadmium the cross-sect ion so obtained is in fair 
agreement with recent exjxmmental estimates. Difficulties, due to tho failure of Bora's 
approximation, arise when an attempt is made to apply the theory to the explanation of the 
electron excitation curves recently observed by Wiedenbeck. # 


1. Introduction 

The purpose of this paper is to attempt the calculation of the cross-section of the 
inelastic scattering of electrons by nuclei. This process may be symbolically repre¬ 
sented by the equation 

e + (Z,A)-+e' + (Z,A)\ (1) 

where e and e f denote the incoming and the scattered electrons respectively and (Z, A) 
denotes a nucleus of atomic weight A and atomic number Z. Tho asterisk denotes 
an excited state of the nucleus. 

The prooess is assumed to arise from the electromagnetic interaction between the 
electron and the nuclear charge. For the determination of this cross-section we shall 
make use of the relativistically invariant theory of the retarded interaction of two 
charged particles developed by Moller ( 1931 ). 

The only alternative interaction between electrons and nuclei capable of leading 
to a process of this kind is that derived on the basis of the Fermi theory of /7-decay. 
Since in the Fermi theory the first order prooess corresponds to the transformation 
of an electron into a neutrino (charge is conserved by the simultaneous transforma¬ 
tion of a proton into a neutron), inelastic scattering must be a second order process. 
For the mere excitation of nuclei, however, the first order process will already give 
a non-vanishing result. The cross-section of this process of approximately 10 ~ 40 cm . 2 
is doubtlessly too small to account for the observed cross-sections for electronic 
excitation. The second order process, on the other hand, leads into the wdl known 
divergence difficulties (for energies for which the first order process is allowed). 
With a suitable cut-off procedure, however, the cross-section corresponding to the 
second order process may be estimated as 10 ~ 64 cm . 2 where the factor 10“ 24 against 
the first order prooess is due to the square of the ‘fine structure constant’ corre¬ 
sponding to the neutrino-theory. 

The actual method applied for the calculation of the cross-section is based on the 
use of a formula derived by Bethe ( 193 a) from the Moller theory. This derivation, 

[ 344 ] 



346 


The excitation of nuclei by electrons 

however, is based on Born's approximation method and thus restricts the applic¬ 
ability of its results to large energies of both incoming and outgoing electron and 
small atomic numbers. 

In order to simplify the calculation it is necessary to expand the integrand of 
certain matrix elements in powers of the nuclear radius divided by the wave-length 
which corresponds to the change of momentum of the electron. Tins simplification, 
which has the advantage of allowing us to compute cross-sections in terms of known 
experimental data, as dipole and quadrupole moments, restricts the applicability of 
the theory to electron-energies below approximately 50MeV for medium nuclei. 

The analysis of this paper has been carried out for electric dipole, electric quad¬ 
rupole and magnetic dipole transitions. In the case of the electric dipole the cross- 
section comes out to be given by equation (27) below. Apart from a trivial numerical 
discrepancy this formula is identical with that used by Mamasachlisov ( 1943 ) for 
the determination of the cross-section of the electronic disintegration of 
'The corresponding formulae for the casos of electric quadrupole and magnetic 
dipole transitions are equations (35) and (40) below. In all cases, the cross-section 
is a monotonic increasing function of the kinetic energy, as shown in figures 1 
and 2 . 

This appears to be due to the assumption of vanishing nuclear charge necessary 
to safeguard the applicability of Born's approximation. The electrons are repre¬ 
sented by plane Dirac waves and the influence of the central Coulomb field is 
neglected. This seems to be a good approximation for high energies but certainly 
fails near the threshold. This may be seen in the following way: The main con¬ 
tribution to the cross-sections comes from those parts of the incoming and outgoing 
electronic wave which are within a sphere with a radius equal to the wave-length 
corresponding to the change of momentum and the nucleus as centre. The wave 
function of the outgoing electron will be violently wrong within a sphere in which 
its kinetic energy due to the Coulomb attraction is equal or larger than its kinetic 
energy at infinity. The radius of this sphere is 



where is the kinetic energy of the electron, and the condition for the applic¬ 
ability of Bom's approximation is r* A, where A is the wave-length corresponding 
to the change of momentum of the electron. The theory will, therefore, be unreliable 


in the region 



where p is the momentum of the incoming electron corresponding to the excitation 
energy of the nucleus. For an atomic number in the neighbourhood of 50 and for 
p ~ 3 me the critical kinetic energy is approximately 70 kV. 


f This paper came to our knowledge after the publication of a recent letter to the editor 
of Nature (Touschek 1947). 



346 


I. N. Sneddon and B. F. Touschek 


This breakdown of the method of approximation is well known to be responsible 
for the interesting anomalies in the electron excitation curves of optical transitions 
(Massey & Mohr 1931 ). Its effect in this analogy is a very rapid increase in the excita¬ 
tion curve near the threshold leading to a maximum, the width of which is roughly 
determined by the wave-length of the scattered electron. In the optical case, how¬ 
ever, the effect is complicated by exchange phenomena. 

We do not attempt to calculate this feature of the excitation curves in the present 
paper. In the limit of low energies it would be a sufficiently good approximation to 
treat the nuclear transition as a point dipole or quadripole but a better treatment of 
the electron wave function would, of course, be required. We would merely point 
out that an increase of the cross-section as compared with that determined by Bom’s 
approximation is to be expected. Formulae (27), (35) and (40) have the common 
feature that the cross-section as a function of the energy of the scattered electron 
increases like J5£ lnt . As an indication of the order of magnitude of the correction 
due to the effects of the Coulomb field around the nucleus we may multiply the cross- 
section by a Gamow factor 



where is the velocity of the outgoing electron with c as unit. (This factor gives the 
value of the modulus of the wave function in a Coulomb field which asymptotically 
. behaves like a plane wave with unit amplitude.) The influence of this correction is 
to make the excitation curve start with a constant value from the threshold, which 
under suitable conditions might lead to a peak of the excitation curve. We have 
indicated this possible behaviour of the curve by the dotted line in figure 1. It will 
be noticed that the width of the peak corresponds to the critical energy defined by 
the breakdown of the Born approximation. 

The formulae for the cross-sections outside the critical region involve quantities 
which can be calculated readily by the use of the liquid-drop model of the nucleus. 
In this model the electric and magnetic dipole moments vanish and only quadrupole 
and higher electric transitions are possible. The cross-section for electron excitation 
of quadrupole transitions in cadmium calculated in this way is, however, roughly 
200 to 2000 times larger than the value estimated experimentally, but the disagree¬ 
ment is not too serious when we consider the limitations inherent in the use of the 
liquid-drop model, 

2. General theory 


We wish to calculate the cross-section for the process described by equation (1). 
If we denote the energy of the incident electron by emc 2 , where m is the mass of the 
electron, and that of the scattered eleotron by e'mc 2 , then the conservation of 
energy gives 


e = e' + A + T\ 


( 2 ) 


where A me 2 is the excitation energy imparted to the nucleus and Tmc % is the energy 
of recoil of the nucleus. Similarly if p me, p'mc are the momenta of the incident and 



The excitation of nuclei by electrons 


347 


scattered electrons and kmc is the momentum of the recoiling nucleus we obtain 
from the conservation of momentum the relation 

P-P+k. (3) 

Now ifMA is the mass of the nucleus the relation between the energy and momentum 

T * nik 2 j2MA < A, 


gives 


so that we may omit T in equation ( 2 ) as long as | k | is not too large. Physically this 
infers that the nucleus is capable of absorbing any amount of momentum without 
affecting the energy balance. 

Bethe has calculated the cross-section for inelastic processes, which according to 
Moller's theory may be written in the form 


dxj> — 2 rjdw 


ee p 
P 


a ■ tr' 


(4) 


where r 0 — 2*8 x 10“ 13 cm. is the radius of the electron and d(o is the element of solid 
angle into which the electron is ejected after inelastic scattering. The quantity P 
is defined by the equation * 

(5) 

where p and p e are the transition densities of the scattering and the scattered 
particles (that is the protons and the electrons) and j and j e are the corresponding 
currents measured with e as unit. The summations in equation (4) have to 
be carried out over the spins of the initial and final states of the electron, 
and | P j 2 denotes the mean value of | P\ 2 over all initial orientations of the nucleus. 
The factor 2 which occurs in equation (4) replaces Bethe’s factor 4 which differs 
from that occurring in our notation because of the fact, that Bethe has to average 
over the spins of the final state whereas we sum over these spins in order to preserve 
a certain degree of symmetry between the incident and scattered electron. 

Since the process under consideration is due to electrons passing fairly far outside 
the nuoleus we may represent the electrons by plane waves, so that 


pt » ass a 0 e /(kr) , j e = u'*aue^ — ae *<*•*>. 


( 6 ) 


In these equations u denotes the spin amplitude and a is the vector formed by the 
Dirac operators a x , a v , a z . It should also be noted that the distance r is measured 
with the Compton wave-length (137r 0 ) as unit. 

The calculation of the total cross-section may be divided into three parts: the 
calculation of the average over the orientation of the nucleus, the spin summations 
and finally the integration over the directions of p\ In order to carry out the first 
two steps of the calculation we note that the quantities p and j are not independent 
of one another. If we write r 

J=* le^jrfr (7) 


Vol. 193. A. 


33 



348 • I. X. Sneddon and B. F. Touschek 


we may make use of the continuity equation 

divj + ~ = 0 

to express jpe^-^dr in terms of J. Considering that p has the time dependent 

factor e , AI , integrating the continuity equation throughout space after multiplication 
by and making use of Gauss’s theorem we obtain » 

fpr«k-»dr = i( k.J). (8) 

Substituting from equation ( 8 ) into equations ( 6 ) and (5) we obtain 

AP = {(a 0 k-Aa)J}, (9) 

where the product involved is a scalar product. Writing for simplicity 


and putting 


s(fr,cr / ) = a 0 k-Aa 0 
£V 8 *s = S, 


( 10 ) 

( 11 ) 


where now the product on the left-hand side of the equation is dyadic, we obtain 
from equation ( 8 ) A’2 2 | P - (J*. S. J). ( 12 ) 

<7 <r' 


It follows from its definition ( 11 ) that the quantity S is a Hermitian tensor in three- 
dimensional space, i.e. Sfj = S jt . The evaluation of the tensor S is straightforward 
and will be carried out in the next section. The averaging over the initial orientations 
of the nucleus is a much more difficult problem; consequently we do not attempt to 
develop the general theory but instead restrict ourselves to certain simplified cases. 

These simplifications arise in the usual way from the expansion of e i( * * r) in powers 
of (k. r). The use of this expansion is equivalent to assuming that (k . r)< 1 within 
the nucleus. This simplification restricts the applicability of our results to energies 
below 70MeV. On the other hand it has the advantage of allowing us to relate 
cross-sections to other experimental data. The range beyond 70MeV becomes 
accessible by the assumption of a free particle model for the nucleus; thiB is the 
procedure generally adopted when the de Broglie wave-length of the incident partiole 
is comparable with the dimensions of the nucleus. In this range the excitation of 
the nucleus to discrete energy-levels is of less importance than processes involving 
the emission of a heavy particle from the nucleus. A special case of this type—the 
electronic disintegration of Be 9 with the emission of a neutron near the threshold of 
this process—has been considered by Mamasaclilisov ( 1943 ). The general theory of 
such processes will be discussed in a later communication. 

The special oases which we shall treat by this method are: (i) electric dipole 
transitions; (ii) electric quadrupole transitions; (iii) magnetic dipole transitions. 

In the electric dipole case the veotor J is defined by 

J«Jj(r)<ir 


(13) 



The excitation of nuclei by electrons 349 

and in the cases of the electric quadrupole and magnetic dipole it is defined by 

J = iATk, (14) 

where the tensor T is given by the formula 

iAT = Jjrrfr, (15) 

where the product of the vectors j and r is dyadic. 

In the c&se of electric quadrupole transitions T must be a symmetric tensor. We 
restrict ourselves to the case in which T in a suitable system of co-ordinates reduces 

°\ 

of, (16) 

0 / 

where r is the electric quadrupole moment of the transition with 137 a r§ as unit. 
On the other hand for magnetic dipole transitions the tensor T is antisymmetric; 


we consider the case 

/ 0 

l 

0 \ 



T* 4-1 

0 

». 

(17) 


\ 0 

0 

0 / 



where ft is the magnetic dipole moment of the transition with 137 2 rg/c as unit. 

It should be observed that the choice of the tensor T cannot be made arbitrarily 
but must be such as not to be in contradiction to the conditions 



the first of which expresses the orthogonality of the wave functions of the initial 
and final state of the nucleus, and the second of which expresses the impossibility 
of electric dipole transitions. It may be verified that the choice (lb) and (17) of 
the tensor T fulfills these conditions. 


to the torm 



3. Determination of the tensor 8 

From the definition (11) and the equations (10) and (b) we obtain immediately 
for the tensor 8 

8 ~ 2 £ (ii* k - Aon*') (u'+k - Aon*), (18) 

<r <r* 

where the produota of the vector-quantities are dyadio and the summations have to 
be oarried out over all states of the electron with positive energy. We get rid of this 
troublesome restriction to positive energies by writing 

u = ~Ku, (ltt) 

*3-a 



360 I. N. Sneddon and B. E. Touechek 

where theoperators K and K' are defined by 


K a* (et.p)+/?+e, #'-(«. p’)+fl+«? (20) 

and give 0 when applied to the amplitudes of states with negative energy. The 
summations can then be carried out over all states, positive and negative, of the 
electron, and use can be made of the orthogonality relation 




in which a and ft are the spinor indices. Carrying out this summation we obtain 


ee'5 = isp(k-Aa)X'(k-Aa)A _ = J 4-B+C, (21) 

where sp denotes the spur with respect to the spin indices and A , B and C are defined 
by the relations 

A = l*p(kK'kK), ) 


B-l A sp (k K’aK + aK'kK), 
0 = |A 2 sp(aA''aA’). 


( 22 ) 


Making use of the relations 


a(a.p) + (a.p)a = 2p, spa 4 = sp/? - 0 

we finally obtain A = kk[ee' + (p.p') +1], 

£ = A{e'(kp +pk) + e(kp' +p'k)}, 

C = A 2 f7{ee' - (p. p') -1} + PP ' + p'p],. 

so that the tensor 8 is determined. 


(23) 


4. Electric dipole transitions 


Sinoe the expression (13) is independent of k the summation over the directions 
of j becomes identical with the summation over the directions of J, so that by 
equation ( 12 ) 

, -Ei(« ) l J l , - ( fi ) l* 


V V I 

cr er* 


I nr 


(24) 


where 



is the geometrical dipole moment with 137r 0 as unit, and ( 8 ) denotes the spur of 8 , 

that is, ( 8 ) «* 2 In the derivation of equation (24) we make use of the theorem 
i 


where the bar denotes that we take the average over all directions of the arbitrary 
unit vector e, and 1 denotes the unit tensor. 



351 


The excitation of nuclei by electron# 


It follow# from equation (24) that for the dipole case we have to caloul&te (S). 
This quantity is obtained from the expression for 8 by writing scalar products 
instead of dyadic products and using the fact that (/) « 3 . This gives 


(A) = i a {ee' + (p. p')+l}, 

(J3) » 2A{e'(k.p) + e(k.p')}, 

(0)-A*{3t«'-(p.p')-l] + 2(p.p # )K 


(25) 


The occurrence of {k 2 - A 2 ) 2 in the denominator of the expression on the right-hand 
side of equation (4) makes an expansion of A - B 4- C in powers of k % — A 2 desirable. 
This may readily be accomplished by replacing the scalar product (p.p') by 
£(P a + p' 2 — k % ). An elementary calculation then gives 

(A ) ~ (B) + (C) - - i(ik 2 - A s ) a + (e 2 4- e' 2 ) (k 2 - A 2 ) - 2 A 2 . (26) 



Fiotxrk 1 . The variation of the cross-section for electric dipole transitions with excitation 
energy A and the kinetic energy of the scattered electron. The dotted curve in figure 16 
corresponds to the multiplication of the cross-section by a Gamow factor as indicated in the 
introduction. 


This expression does not contain the azimuthal angle so that the element of angular 
integration do> becomes simply 2nd(co&0). Now 

d(COB0) - -■—><*(*•) 


and k * varies from (p-p') % to (p +//)*, so that the integration over w is straight¬ 
forward and gives 

f^H] < 27 > 

after some simple modifications baaed on the energy-momentum relation. 




352 I* N. Sneddon and B* F. Touschek 

In the limiting case p~+ 0 we have e m 14 * A and e f * 1 , so that actuation (27) 

reduces to »' T 2 1 

<f> _ 4^|x| 2 ^- + AJ (28) 

near the threshold of the reaction. On the other hand in the limiting case p->oo, 
p' » p t we have - -i 

^ « 87 "o i * I 8 [ lo &^~ 1 J • ( 29 ) 

The close resemblance of equation (29) to Bohr’s formula (Bohr 1915 ) for the 
ionization losses of a charged particle should be observed. 

The function <j>i$nrl \x\ 2 was calculated for various values of the energy of the 
scattered electron and of the excitation energy A by means of equation (29). The 
results of these calculations are illustrated in figure 2 . 


5, Electric quadrupole and magnetic dipole transitions 

If we insert from equation (14) into equation ( 12 ) we obtain in both the electric 
quadrupole and the magnetic dipole cases 

s £ I p | 2 - (kTtf Tk). (30) 

<r cr' 

If we assume that the tensor T is symmetric then the equation of continuity multi¬ 
plied by rr and integrated gives 

j*(Vj)rrdr « -i&jprrdr 

and by the symmetry property 

J(Vj)rrdr = -Jjrdr = -iAT, 

so that the symmetrical part of the tensor T represents the geometrical quadrupole 
moment. Consider the symmetric tensor 


r 1 °\ 

r^rjl 0 01, 

\o 0 0 / 


which is the simplest case of an electric quadrupole moment of magnitude r measured 

with 137 2 rg as unit. We may write T as the difference of two dyadic products in the 

form fn wf* 

T ~ aa-bb, 

where, in the co-ordinate system we have selected, 


(i’i4 2’ 0 )’ 


bo that (a .b) » 0. Thus for Tk we obtain immediately 

Tk « a(a.k)-b(bk) *kxc, 



The excitation of nuclei by electrons 


353 


where c *» a x b is a unit vector. Inserting this expression into equation (80) we 
obtain therefore . 

isS|PI 1 - ((cxk)fl(cxk)). 

' cr cr' 

To obtain the average of this expression over the initial orientations of the nucleus 
we calculate c x k by putting 

k-(4,0,0). 


Hence 
so that 

and therefore 


cxk = (0,^, -c v ), 

4 s S I P I 3 - k\S v A + S a c\~ 2 S„c,c t ) 


' , 
• cr it 


S S \ P I* - k*{S yv + 8J = k*(S) - (kSk). 


r‘77 


(31) 


The quantity (S) has been calculated above—equation (26). The calculation of 
(kSk) proceeds along similar lines. Front equations (23) we have 

(ki4k) = fc 4 {ee' + (p.p') + 1 }, 

(kBk) = 2AF*{e'(k. p) + e(k. p*)}, i- (32) 

(kCk) = A*{/r a fee' - (p.p') - l] + 2(p.k) (p'.k)}., 

and therefor© after carrying out the reductions sirailar to those in the dipole case 
we have 1 

(kSk) - ~ {4 ee’{k* -A 2 ) 2 - (/c 2 - A 2 ) 3 }. (33) 

Substituting from equations (20) and (33) into equation (31) we have 

_ T 2A2 

£ E I P I 2 - — {(fc* - A 2 ) 2 4* 2 (p 2 +p' 2 ) (fc 2 - A 2 ) - 4A 2 }. (34) 

<r cr' 06 ^ 

Inserting from this equation into equation (4) and integrating over the directions 
of the scattered electron we obtain finally for the cross-section in the case of electric 
quadruple transitions 




where the funotion F(e’, A) is defined by the equation 

It follows immediately from equations (35) and (36) that as p’ -* 0 


(35) 


( 36 ) 


in 


^_rJr*T*A(2A + A*)<p' 


and that asp, p’-+ oo, 0-*~^>u'''*A , log < (-^-j. 



354 


I. N. Sneddon and B. F. Touschek 


Values of the function F{e , A) were calculated for various values of the energy e' 
and of the excitation energy A, by means of equation (36). The variation of this 
function with e' and A is shown graphically in figure 2 . 



Figure 2, The variation of the function F(e', A) with e' and A- The cross-section in the 
electric quadrupole case is obtained by multiplying F by the corresponding factor 

in the magnetic dipole case is 4/rrJ/r 1 /3. 


In the case of magnetic dipole transitions the tensor T occurring in equation (30) 
may be taken to have the form (17). Introducing unit vectors e xi e y orthogonal to 
one another wo may write j. . (,J) 


from which it follows immediately that 

Tk « fi(* x k y ~e y k x ). 

From the definition (11) of the tensor 8 we then have 



and averaging over the initial orientations in space of the nucleus we obtain 

£ £ \Pf - - xSx k). (38) 

<r <r' 


It follows from equation (23) that since both A and B contain the vector k in the 
dyadio produota that (k)S A xk) , {kxB xk) - 0 , 

so that only the term C gives a non-vanishing contribution. Furthermore 

(k x I x k) = — 2k\ 

bo that with the definition (23) we have after a little reduction 

SS HP? - ^V-A^2(p» +/ /«)(**-A*)-4A*}. 


(39) 



The excitation of nuclei by electrons 355 

Comparing this formula with equation (34) we see that the integrations over the 
directions of 1 the scattered electrons are the same as in the electric quadrupolo case 
leading to 4 - ■ 

<t> - Y r i^ 2 T(e\A)^ (40) 

where F(e\A) is defined by equation (30). 


6. Discussion 

It will be> observed from figures 1 and 2 that in all three types of transitions 
considered here the cross-section increases with increasing kinetic energy of the 
scattered electron. Near the threshold energy the form of all the curves is parabolic, 
but for large values of the kinetic energy it is logarithmic. The diff erence between the 
curves for electric dipole transitions and those for higher order transitions arises 
from the difference of their dependence on the threshold energy. In the former case 
the cross-section decreases as the*threshold energy A increases (the kinetic energy 
of the scattered electron being kept constant). 

In the equations (28) and (35) the quantities | x | 2 and r 2 still remain to be cal¬ 
culated. This can be most conveniently done by the use of the liquid-drop model 
(Bohr & Kalckar 1936; Bethe 1937 ). In this model of the nucleus the electric and 
magnetic dipole-moments vanish and only quadrupole and higher order radiation 
can occur. The vanishing of the electric dipole moment arises from the fact, that 
in this model, the centre of gravity of the protons coincides with the centre of 
gravity of the nucleus as a whole. That there is no magnetic dipole moment is due 
to the lack of rotational components in the displacement current. A model which 
might give a magnetic dipole moment is the rigid rotator model recently proposed 
by Guggenheimer (1942). 

In the liquid-drop model the frequency of the Zth harmonic of the surface wave is 
given by the formula a 

where 0 is the surface energy, 9 64* MeV, f> is the density and S the surface of the 
nucleus. The mode l — 2 is the only one giving a contribution to the electric quad¬ 
rupole moment. The energy of the first excited level is w so that taking the nuclear 
radius to be 1-6 x 10 - 1 M* cm. we obtain 

A - 38-74-J (41) 


giving first exoitation levels at 4-95 MeV for oxygen and 1-27 MeV for uranium. In 
the liquid-drop model the quadrupole moment of the nucleus is given by the equation 


ZR„bA * 
5Ajj ’ 


(42) 


where b, the amplitude of the second harmonic is related to A by 

mc s A ~ 2Gb*jl5R i . (43) 

Combining equations (41), (42) and (43) we obtain 

r » l-22Z.4fc.10 - 6 


( 44 ) 



356 


I. N. Sneddon and B. F. Touschek 


for the quadrupole moment corresponding to the transition from the ground state 
to the first excited state. Substituting from equation (44) into equation (35) we 
may calculate the cross-section for this process. 

For cadmium (Z « 48) we may compare the value 1*9 MeV for mc*& given by 
equation (41) with the experimentally observed value l* 3 MeV. This gives us some 
measure of the validity of the use of the liquid-drop model and the accuracy to be 
expected from the calculations of the cross-sections based upon the use of this 
model. For cadmium equation (44) gives r * 0*85 x 10~ 3 and for a kinetic energy of 
the scattered electron equal to lOOkV and A « 3*8 it follows from linear interpola¬ 
tion of the curves of figure 2 , that F(e\ A) is approximately equal to 10. As a con¬ 
sequence of equation (35) we find that the corresponding cross-section is of the 
order 2 x 10 ~ 30 cm, 2 . 

This is in excess of the value 10 ~ 32 cm . 3 estimated by Collins & Waldmann ( 1940 ) 
for indium but not violently so when we consider the limitations inherent in the 
use of the liquid-drop model and the assumptions which Collins & Waldmann had 
to introduce about the efficiency of their counters in deriving the experimental 
estimate of the cross-section. On the other hand if we use Collins's estimate of the 
cross-section to calculate the quadrupole moment of the transition with the help of 
equation (35) we find that it comes out to be of the order 10 “ 26 cm . 2 whioh is small 
but within the possible range for this quantity. Moreover there is no reason to believe* 
that the process observed does not correspond to a higher order transition than the 
electric quadrupole. Indeed the cross-section for electric octupole transitions should 
be smaller by a factor of order (137r 0 /jR) 2 (r 0 - Loren tz-radius of the electron, 
R «* radius of the nucleus) which in the case of indium is about 1600 leading to a 
cross-section in fair agreement with the estimate of Collins & Waldmann. 

The other possible experimental test of the theory on the other hand—the 
experiments on electron-excitation of cadmium by Wiedenbeck ( 1945 )—unfor¬ 
tunately concerns the region in which Born’s approximation breaks down. The 
general trend, however, of Wiedenbeck’s excitation curves may be well understood 
by means of the above considerations though a direct comparison would require 
a more detailed theory of the behaviour of the excitation curves near the threshold. 

References 


Bethe 1932 Z . Phya. 76, 293. 

Bethe 1937 Rev. Mod. Phya. 9, 224. 

Bohr 1915 Phil. Mag . (vi), 30, 1 581. 

Bohr <& Kalckar 1936 K. danake vtdenak. Selak. Skr . Mat.-fys. Medd. 14, no. 10. 
Collins & Waldmann 1940 Phya. Rev. 57, 1088. 

Grimsehl 1934 Lehrbuch d. Phyaik, 2 , 2. Berlin: Teubner. 

Guggenh aimer 1942 Proc. Roy. Soc. A, 181, 109, 

Mamasachlisov 1943 J. Phya. U.S.S.R. 7, 239. 

Massey & Mohr 1931 Proc . Roy. Soc . A, 132, 605, 

MOller 1931 Z. Phya. 70, 786, 

Touschek 1947 Nature, 160, 500. 

Wiedenbeck 1945 Phya. Rev. 67, 92. 



The mechanism of the carbon dioxide-carbon reaction 

By J. Gadsby, F. J. Long, P. Sleightholm and K. W. Sykes 
Physical Chemistry Laboratory, Oxford 

(Communicated by C. N. Hinshelwood, F.R.S.—Received 27 November 1947 

—Read 29 April 1948) 

The detailed mechanism of the reaction between carbon dioxide and coconut shell charcoal 
has been studied by both flow and static methods. The temperature has been varied in the 
range 700 to 830° C and the pressures of the gases from 10 to 760 mm. 

The static method has been used to investigate the adsorption of the gases on the carbon 
surface during the course of the reaction, and thus to illustrate in a very direct way the 
general nature of the mechanism. Accurate measurements of the rate of reaction have been 
made by the flow method. At a given temperature the rate of reaction can be represented in 
terms of the partial pressures of the gases by an expression of the form 


raw = :--—;-. 

1+^aPco+ 

The three separate constants have been evaluated and each has been found to vary exponen- * 
tially with temperature. From these results the rates of the individual stages of the mechanism 
have been calculated. 

The first stage is the decomposition of the carbon dioxide molecule into an atom of oxygen 
which is adsorbed by the carbon and a moleculo of carbon monoxide which passes into the 
gas phase. Only certain sites on the charcoal surface take part in t he reaction; they represent 
about 0-5 % of the total area and probably consist of some of the less firmly bound carbon 
atoms situated at lattice discontinuities. The rate of the first stage can be accounted for by 
assuming that reaction occurs in those collisions in which the combined energy of the active 
carbon atom and the incident carbon dioxide molecule exceeds 68 keal. The second stage is the 
evaporation of the adsorbed oxygen atom, togethor with an atom of carbon from the solid, 
to form gaseous carbon monoxide; the activation energy is thought to be 38 keal., and the 
low value of 10 7 sec, -1 obtained for the non*exponentiftl factor is discussed. 

The retarding effect of carbon monoxide is due to the adsorption of the gas on the reaction 
sites, the heat given out in the change being 46 keal. On tho basis of this value, which implies 
that the molecule is adsorbed chemically, it is possible to calculate theoretically the order 
of magnitude of the retardation constant, k z . 

Introduction 

This investigation forms part of a comprehensive study of the reactions of the 
steam-carbon system. The effect on the separate reactions of changes in the partial 
pressures of the gases has been described in a previous paper (Gadsby, Hinshelwood 
& Sykes 1946 ), in which the general interpretation of the main kinetic relationships 
was also discussed. In continuation of this work, the detailed mechanism of the 
reactions in which solid carbon is converted into gaseous carbon monoxide has now 
been studied. The present paper deals with the results obtained for the carbon 
dioxide-carbon reaction, and an accompanying one with those for the steam- 
carbon reaction. 

Outline of method 

The principle of the method is the complementary use of both the flow and tho 
static techniques. The variation of the rate of reaction with the partial pressures of 

r 357 ] 



358 J. Gadsby, F« J. Long, P. Sleightholm and K. W. Sykes 

the gases and with temperature is determined with a flow apparatus. This technique 
is the most accurate one for the purpose, since a steady state under any desired 
conditions can be maintained while the measurement is being made. The results, 
however, refer to a whole sequence of elementary processes and can be interpreted, 
as shown previously, in terms of a number of different kinetic schemes. In order to 
decide between the various possibilities, the adsorption of the gases during the 
course of the reaction is studied with a static apparatus. These experiments provide 
the information which is required to calculate the rates of the individual stages 
from the results obtained by the flow method. 

Both the carbon dioxide reaction and the steam reaction are investigated with 
samples of the same coconut shell charcoal. Although this material contains a small 
percentage of catalytic impurities, its behaviour has been shown to be similar to 
that of many different types of carbon (Gadsby et al. 1946). It is probable, there* 
fore, that the impurities merely influence the rate of one or more stages of a kinetic 
scheme characteristic of carbon in general. The properties of the impure substances 
are of considerable practical importance, and their study is also an essential step in 
the solution of the problem presented by the catalysis. 

Experimental technique 

The flow apparatus is similar to that described previously (Gadsby et al. 1946), 
but greater accuracy is obtained by the use of an improved method of temperature 
control (Gadsby & Sykes 1948), which regulates automatically the distribution 
of temperature in space as well as in time. In this way the temperature is kept 
uniform to within 2° C over the whole charcoal column and does not vary by 
more than ± 0 * 5 ° C at any point. During each run the controlling thermocouple is 
standardized against a platinum/platinum-rhodium one. Improvement has also 
been made in the calibration of the flowmeters, for which an accuracy of ±0*3 % 
is obtained. The amount of charcoal is varied from 35 g. at the lowest temperature 
to 8 g. at the highest temperature. This ensures that only a small fraction of the 
carbon dioxide reacts, and consequently that its partial pressure and the total rate 
of flow are approximately uniform throughout the reaction zone. To allow for the 
change during each experiment in the amount of charcoal present, the rate of 
reaction for a given composition of the gas stream is measured at least twice, and 
the results are adjusted by interpolation to refer to 85 % of the initial amount of 
charcoal. In all other respects the technique of the measurements is the same as in 
the previous investigation. 

In the static apparatus the charcoal is contained in a silica vessel heated in an 
electric furnace. Unless otherwise stated, the volume of the vessel is 290 c.o. and 
the amount of charcoal 10 g. The temperature is measured with a platinum /platinum 
rhodium thermocouple, and is controlled to within ± 0-5° C by means of a platinum 
resistance thermometer in conjunction with a relay circuit of the type described 
by Coates ( 1944 ). Gases to be introduced are first admitted to a section of the 



The mechanism of the carbon dioxide-carbon reaction 359 

apparatus of known volume fitted with a mercury manometer; the amount sub¬ 
sequently transferred to the reaction vessel can then be calculated from the external 
decrease in pressure. Another mercury manometer measures the pressure in the 
reaction vessel, and the composition is determined by withdrawing samples which 
are analyzed with a Haldane or with a Bone & Wheeler apparatus. To study the 
course of the reaction, a series of experiments is made in which the same amount of 
gas is allowed to remain in contact with the charcoal for a series of different times. 
Between each experiment the reaction vessel is evacuated at’the working tempera¬ 
ture by a mercury vapour pump for a time sufficient to ensure reproducibility. For 
this purpose periods of from 10 min. to a few hours were found to be adequate. 


Results 


(1) The flow method 

The general form of the results 

Measurements are made at six temperatures in the range 734 to 829° 0. The 
variation of the rate of reaction with the partial pressure of carbon dioxide is deter¬ 
mined by passing through the charcoal column a series of carbon dioxide-nitrogen 
mixtures, the rate of flow of nitrogen being chosen so as to keep the total rate of flow 
constant at 1200 c.c./min. The rate of reaction, defined as one-half the rate of 
formation of carbon monoxide per g. of charcoal, is plotted against the initial 
partial pressure of carbon dioxide on the left-hand side of figure 1 . At each tem¬ 
perature the reaction is of fractional apparent order. To determine the effect of 
carbon monoxide, various small partial pressures of it are added, the partial pressure 
of carbon dioxide and the total rate of flow being kept constant by adjustment of 
the rate of flow of nitrogen. The conditions are such that the rate of the reverse 
reaction is negligible. The right-hand side of figure 1 , where the rate of reaction is 
plotted against the initial partial pressure of carbon monoxide, shows that there is 
a strong retardation over the whole temperature range. It may be noted that the 
percentage reduction in the rate for a given increase in t he partial pressure of carbon 
monoxide is more marked the lower the temperature. 

The rate expression 

It will first be established that the results at any given temperature can be 
represented by an expression of the form 


, rate 


_ Pco 2 _ 

1 + k 2 Pco + ^Pco% 


(1) 


It follows from the treatment outlined previously (Gadsby et al. 1946 ) that under 
certain conditions equation ( 1 ) leads to the relation 


™ 1 +^co, , kt'Pco 

x ~ fc k 


( 2 ) 



rate (g. COj/min.) 


360 J. Gadsby, F. J. Long, P. Sleigktholm and K. W. Sykes 

In the above, x is the fraction of the carbon dioxide reacting, m is the mass of the 
charooal, the superscript zero denotes the initial value and pco i® the average partial 
pressure of carbon monoxide; k is now defined by the equation 

k = ^BT/V, (3) 



Figure 1. Apparent order of reaction with respect to carbon dioxide and retardation 
by carbon monoxide. Initial pressure of CO,: O 633 mm.; 3 253 mm.; ® 127 mm. 


where V is the total rate of flow at the temperature T of the reaction zone. The con¬ 
ditions which must be satisfied for equation (2) to be applicable are as follows: 

(i) the total rate of flow is constant throughout the reaction zone; 

(ii) the rate of diffusion in the direction of flow is negligible compared with the 
rate of flow; 

(iii) In (1 — x) may be expanded with neglect of powers of x greater than the second; 

(iv) 2k t p% Qi p 1 and 2p% 0t >p% 0 . 

These requirements are fulfilled to a close approximation. The total rate of flow 
does not vary by more than 3-8 %, and the rates of diffusion can be shown by rough 
calculation to be less than 2 5 % of the rates of flow. The fraction of carbon dioxide 
reacting rarely exceeds 0-10 and is never greater than 0*17. The average value of 
2fc»i>oo, is of the order of 50 and 2p% 0 Jp% 0 is not less than 2*8. 

Equation (2) may therefore be used to test the rate expression by plotting m/x 
against p co for various values of The typical results shown in figure 2 are 




The mechanism of the carbon dioxide-carbon reaction 361 

well represented by straight lines as required by equation (2). The slope at a given 
temperature should be equal to k % /k and should thus be independent of p&o a ; the 
slight variation observed is not systematic and is probably due to lawk of oomplete 
reproducibility from one experiment to another. In figure 3 the intercept m/x 0 at 



Figure 2, Test of rate expression: variation of carbon monoxide pressure. Initial pressure 
of CO,: O 833 mm.! 3 506 mm.; © 380 min.; © 253 mm. and #127 mm. 

p co * 0 is plotted against p%o,- These intercepts are calculated by subtracting 

k t poo Ik from the values of m/x obtained in the experiments where the carbon dioxide 

pressure is varied without the addition of carbon monoxide. It can be seen that the 

linear relation ... „ 

m _ l+^aPco, 

" k W 

required by equation (2) is obeyed within the limits of error at each temperature. 




362 J. Gadsby, F. J. Long, P, Sleightholm and K, W. Sykes 

The rate of reaction can thus be represented closely by equation (1). The values 
of k y and k 3 are calculated from the slopes and intercepts of figure 3, and those of 
k 2 from the slopes of figure 2; all straight lines are drawn by the method of least 
squares, and the probable errors of the constants are estimated from the residuals 
in the usual way. In figure 4 the logarithms of k v k t and k a are plotted against the 



Figure 3. Test of rate expression: variation of oarbon dioxide pressure. 


reciprocal of the absolute temperature, the radii of the circles being equal to the 
probable errors. The variation of all three constants with temperature is adequately 
represented by the straight lines drawn by the method of least squares. A ooncise 
summary of the measurements may thus be given as follows: 

Jtj =* k 01 e~ E * IHT , where k 01 = 10 S 8 ±° , *g.mol.min.~ J atm.- 1 g.- 1 , 
and E t = 58-8 ± 20 kcal.g.mol.- 1 ; 




363 


The mechanism of the carbon dioxide-carbon reaction 

K ~ kdte +x * IRT l where k ot = ]0~ 7# i°' 7 atm.~ 1 , 

and E 2 = 45-5 + 3*5koa1.g.mol. -1 ; 
k 3 = &os e ' Kt,KT , where k m = lO^i^’atm.- 1 , 

and E 3 = 30-1 + 3-6kcal.g.mol. -1 . 



0-90 095 100 

l/T x 10 s 

Fiqitre 4. Variation of k lf k % and k z with temperature. 


The question of diffusion 

It will be shown that the above constants are independent of the total rate of flow, 
and thus that the reaction is not controlled by diffusion outside the charcoal granules. 
The experimental points in figure 5 represent the rate of reaction as a function of 
the total rate of flow. The curves are calculated from the rate expression assuming 
that k v k 2 and lc B are constant, but allowing for the change in the partial pressure of 
carbon monoxide. They are obtained by solving for # as a function of V in the 


expression 


1 _ l+fc 3 PcOi + 

xV~ mk x RT * ~ mlc x RT 


(5) 


which follows from equations (2) and (3); the constant terms (1 + k B j)% 0 ^!mk x RT 
and k % jmk x RT are evaluated from the two rates of reaction at 1200c.c./min. at 
each temperature. The agreement between the experimental points and the cal¬ 
culated curves shows that the observed effect is caused entirely by the change in 


Vo1. 193, a. 


34 





364 J. Gadsby, F, J. Long, P. Sleightholm and K. W. Sykes 

the partial pressure of the carbon monoxide formed, resulting from the change in 
the rate of flow. The much smaller variation of the lower set of experimental points 
at each temperature, obtained in the presence of an added partial pressure of carbon 
monoxide, large compared with that formed in the reaction, strongly supports the 
above interpretation. Thus, as previously observed with the coal charcoal, diffusion 
outside the granules is not an important factor. This conclusion is consistent with 
the exponential variation of the rate of reaction with temperature, which also rules 
out the possibility of serious interference by diffusion inside the granules. 



Figube 5. Variation of total rate of flow. Comparison of experimental 
points with curves calculated from equation (5). 

(2) The static method 

The course of reaction 

Figure 6 shows the changes which occur both in the gas phase and on the carbon 
surface, when carbon dioxide is admitted to a vessel of volume 17flc,c. containing 
6g, of charcoal. The initial pressure, poo»> ™ the pressure which the carbon dioxide 
would exert if neither adsorption nor reaction took place; it is calculated, on the basis 
of calibration experiments with argon, from the amount of gas introduced. The 
amount of oxygen adsorbed is expressed as the pressure which it would exert as 
carbon monoxide in the gas phase, and is taken to be 2p% 0% - (2p co> +p c0 ). This 
stoichiometric definition makes no assumptions about the state of the oxygen on 
the surface and thus includes that present either as adsorbed carbon monoxide or 
in any other form. 



The mechanism of the carbon dioxide-carbon reaction 365 

Initially there is a rapid decrease in the pressure of carbon dioxide; it is accom¬ 
panied by a slightly smaller increase in the amount of oxygen adsorbed and by a 
slightly larger increase in the pressure of carbon monoxide. The first stage is thus the 
reaction of the carbon dioxide with the surface to give an atom of oxygen which is 
adsorbed and a molecule of carbon monoxide which passes into the gas phase: 

CO a •■*(<)) +CO. ( 6 ) 



Figure 6 . The course of reaction. A change of pressure of 1*0 mm. is equivalent in 
amount of reaction per g. of charcoal to a change of 0*81 nun. in figures 7, 8 and 9. 


This result is in agreement with the work of Langmuir ( 1915 ) with graphite, of 
Broom & Travers ( 1932 ) with coconut shell charcoal and of Semechkova & Frank- 
Kamenetzky ( 1940 ) with sugar charcoal. The second stage is presumably the 
evaporation of the oxygen as carbon monoxide: 

C + (0)->00. (7) 

The rate of reaction might thus be expected to be proportional to the amount of 
oxygen absorbed, once an approximately steady value of the latter has been 
established. However, it can be seen from figure 6 that, although the rate of reaction 
falls continually, the amount of oxygen adsorbed at first rises and then remains 
substantially constant. Further evidence for the behaviour of the oxygen is given in 
figure 7, where the reaction is followed much more nearly to completion. Two explana- 


24-2 




366 J. Gadsby, F. J. Long, P* Sleightholm and K, W. Sykes 

tions may be proposed; the progressive adsorption of carbon monoxide may com¬ 
pensate for the expected decrease in the amount of oxygen which is transferred 
from carbon dioxide to the surface, or the reaction may take place on sites of suc¬ 
cessively decreasing reactivity. That the adsorption of carbon monoxide is an 
important factor is suggested by the following experiments. 



Figure 7. The amount of oxygen adsorbed during the reaction. Each curve is 
labelled with the initial pressure of carbon dioxide. 


Pco t 

(mm.) 

Table 1 

Puo 

oxygen adsorbed 

(mm.) 

(mm.) 

25 

228 

25 

58 

263 

25 

96 

276 

29 

125 

304 

35 

23 

134 

21 

53 

169 

26 


Mixtures of carbon dioxide at various pressures and of carbon monoxide at a 
large constant pressure are introduced, and analyses are made after each mixture 
has been in contact with the charcoal for 20 min. Typical results obtained at 750° C 
are recorded in the first four rows of table 1. 

The wide variation in the pressure of carbon dioxide is not accompanied by a 
corresponding change in the amount of oxygen adsorbed, which follows more 
closely the smaller variation in the pressure of carbon monoxide. A large part of the 
oxygen on the surface during the later stages of the reaction is thus probably due 
to the adsorption of carbon monoxide. The figures in the last two rows were obtained 
without the addition of carbon monoxide. Comparison with the values above shows 
that, at constant pressure of carbon dioxide, the amount of oxygen adsorbed does 




The mechanism of the carbon dioxide-carbon reaction 367 

not vary greatly with change in the pressure of carbon monoxide. This result can 
be understood on the basis of the competitive adsorption of oxygen from carbon 
dioxide and of carbon monoxide. It also implies that the alternative retardation 
mechanism, represented by the reverse of equation (6), does not in fact make any 
important contribution; otherwise increase in the pressure of carbon monoxide 
would lead to a marked decrease in the amount of oxygen adsorbed. It is probable, 
therefore, that the adsorption of carbon monoxide is the factor which causes the 
reaction to be retarded by that gas, 

The adsorption of carbon monoxide 

The following results show that the behaviour of carbon monoxide is consistent 
with the proposed mechanism of reaction. In figure 8 the decrease of pressure which 
occurs when the gas is introduced is plotted against time. Since only small amounts 



Figukk 8 . The rate of adsorption of carbon monoxide. O 850 Q C; • 800° C. 


of carbon dioxide are formed, the curves represent closely the extent to which carbon 
monoxide is adsorbed. Comparison with figure fi shows that the adsorption is slow 
compared with the initial reaction of carbon dioxide. Within 10 min., however, the 
greater part of the adsorption has taken place, so that the process might be expected 
to be significant in the later stages of the reaction. The amount finally adsorbed, 
taken as the decrease of pressure less that of the carbon dioxide formed, is plotted 
in figure 9 against the pressure of the carbon monoxide remaining in the gas phase. 
It can be seen that the amount of oxygen on the surface in the later stages of the 
reaction (table 1) is similar to the amount of carbon monoxide which would be 



368 J. Gadsby, F. J. Long, P. Sleightholm and K. W. Sykes 

adsorbed* However, while the extent to which carbon monoxide retard® the reaction 
decreases rapidly as the temperature is raised ( k % , figure 4), the adsorption isotherm 
is roughly independent of temperature. To avoid this difficulty, it may be supposed 
that, in addition to its reversible exothermic adsorption on the reaction sites, carbon 
monoxide is also adsorbed on less active parts of the surface. If equilibrium was not 
established with respect to these latter sites, the number occupied would increase 
with rise of temperature, and the total adsorption might thus remain approximately 
constant* 



50 100 150 

Pvo (min.) 

Figure 9. The amount of carbon monoxide adsorbed. O 850° 0 ; 0 800° C. 


The total area of the charcoal has been estimated from the adsorption isotherm for 
water vapour at room temperature to be 3 x 10 ®cm. 2 g. 1 (Kipling, unpublished 
work). To cover this area with a close-packed layer of carbon monoxide about 
3 x 10 3 g,mol. would be required. Up to 1*5 x 10" 6 g.mol. of carbon monoxide is 
adsorbed by 1 g. of charcoal (figure 9) and the largest amount of oxygen ever observed 
on the surface during the reaction (figure 6 ) is equivalent to 8*7 x lO^g.mol.gr 1 . 
It is clear, therefore, that only a small fraction of the charcoal surface takes part in 
the reaction under these conditions. The estimation of the area represented by the 
active sites is complicated by the assumption that carbon monoxide may also be 
adsorbed on other parts of the surface, but 0*5 % of the total area is thought to be 
of the correct order of magnitude. 

The effect of argon and nitrogen 

The assumption that diffusion is not an important factor, even in the static 
apparatus, can be justified by the fact that the reaction is not retarded by an inert 




The mechanism of the carbon dioxide-carbon reaction 369 

gas. Figure 10 shows that the addition of argon sufficient to increase the total 
pressure by a factor of four is without effect on the reaction. Although these experi¬ 
ments rule out the possibility that conditions outside the charcoal granules might 
have been different in the absence of the flow, they add no information to that 
provided by the temperature coefficient about the situation in the finer pores inside 
the granules. There the rate of diffusion is limited by collisions with the walls, since 
the diameter of the pores is of the order of 10~ 7 cm. (Kipling, unpublished work),* 
and is thus smaller than the mean free path of about 10 4 cm. at a pressure of 200 mm. 
Analogous experiments with nitrogen, also illustrated in figure 10, show that this 
gas has no specific effect on the reaction, and so justify its use as the inactive carrier 
in the flow apparatus. 



Figure 10. The effect of argon anti nitrogen. Carbon dioxide at an initial pressure of 110 mm. 
in the presence of # no added gas; 0 108 mm. argon; (J 209 min. argon; O 294 min. 
nitrogen. 

The rate of reaction 

Although the static method is unsuitable for accurate measurement of the rate 
of reaction, the following observations provide some further support for the con¬ 
clusions which have already been reached. 

In table 2 the rate of reaction is compared with the values expected on the basis 
of the flow experiments. The first two columns give the pressures of carbon dioxide 
and carbon monoxide at various stages of the reaction represented in figure 6, The 




370 J. Gadsby, F. J. Long, P. Sleightholm and K. W. Sykes 

observed rates of reaction are obtained by drawing tangents to the graph of the 
total pressure as a function of time, and the calculated ones by substituting in 
equation (1) the pressures in the first two columns and the values of the constants 
obtained in the flow experiments. A small correction is also made for the reverse 
reaction. Although initially the observed rate exceeds the calculated one, there is 
good agreement in the later stages. However, since it is only adsorbed slowly, the 
carbon monoxide formed in the initial reaction of the carbon dioxide with the 
surface cannot exert its full retarding effect immediately; it is only in the later 
stages that a steady state should he approached. The alternative retardation 
mechanism, represented by the reverse of equation (6), would require the untenable 
hypothesis that the init ial reaction of the carbon dioxide at first goes far beyond an 
equilibrium position to which it subsequently returns. If the initial transfer of the 
oxygen atom to the surface takes place in an end-on collision of the carbon dioxide 
molecule, there is, however, no reason why the carbon monoxide formed simul¬ 
taneously should not pass directly into the gas phase and only be adsorbed sub¬ 
sequently. The possibility that the high initial rate of reaction is due to the greater 
intrinsic reactivity of the fresh carbon surface would appear to be ruled out by the 
following experiments. 

Table 2 

rate of reaction 


Poo, 

Pv 0 

observed 

calculated 

(nun.) 

(nun.) 

(mm.min.~ 3 ) 

(inin.min." 1 

308 

92 

36 

23 

260 

1«0 

24 

12 

232 

200 

13 

8*5 

212 

240 

7 

0*5 

192 

2fi0 

4*5 

4*5 

182 

300 

4 

4 


Table 3 


temp. 

x 10 4 

1 atrn. -1 

rc) 

static 

flow 

690 

0-10 

0*20 

709 

0-36 

0*36 

729 

0-48 

0*85 

750 

1*40 

1*55 

708 

3*19 

2*57 

788 

4-62 

4*47 


The rate of increase of pressure is measured in the first few seconds after the 
introduction of carbon dioxide. The results should be equal to the rates of reaction 
at the pressures of carbon dioxide introduced, less corrections for the following 
factors. The pressure of the carbon dioxide is lowered by the initial transfer of oxygen 
to the surface, and the carbon monoxide so formed reduces the rate of increase of 
pressure both by its adsorptlbn and by the consequent retardation. Now the cor- 



The mechanism of the carbon dioxide-carbon reaction 371 

rections for all these factors tend to zero as the initial pressure is reduced; the rate 
of reaction, divided by the initial pressure of carbon dioxide and extrapolated to 
zero value of the latter, should thus be equal to the constant k x of equation (1). 
Table 3 shows that the values so obtained are in reasonable agreement with those 
determined by the flow method. 

(3) Summary of conclusions 

The experimental results can now be summarized and the rates of the individual 
stages of the reaction calculated from the measurements made by the flow method. 
The experiments with the static apparatus suggest that the important steps are the 
following: 


CO a —— 

•(()) +CO, 

(8) 

C + (0)-£- 

► CO, 

(9) 


‘(CO). 

(10) 




The retardation by carbon monoxide is due to the adsorption of the gas oa the 
reaction sites, which represent about 0*5 % of the total surface of the charcoal. 
Assuming that the sites are equivalent, independent and each capable of adsorbing 
one atom of oxygen or one molecule of carbon monoxide, it can be shown (Gadsby 
cl al. 1946) that the rate of reaction in the steady state is given by 

rate =—-. (ii) 

1 + ?8jPcp + liPon, 

J 2 

Equation (11) is of the same form as the rate expression, equation (1), and is thus 
consistent with the results of the flow experiments. If the constants in equation (11) 
are identified with the corresponding ones in equation (1), the following values are 
obtained for the rates of the individual stages of the reaction: 

« k x - 10 8 * 8 ±°* 6 e 'MMo± 2 t w)iKT g m ol. min/” 1 atm.“ l g.~b (12) 

k 3=5 k i!h = 10 2 ‘ 3 ±°* 9 e -swjooi-Moo/tfrg,mol. rain.^g.- 1 , (13) 

ijj % » » 10“ - 7*®iO’7 £+46,5Q0;i ; 3,500//P7’ atm.*" 1 . (14) 

The exponential terms in equations (12) and (13) give the activation energies of the 
corresponding processes. That in equation (14) represents the difference between 
the activation energies for adsorption and desorption, and thus the energy given out 
when carbon monoxide is adsorbed on the reaction sites. 

To interpret the results on the basis of the alternative retardation mechanism 
(equation (27) Gadsby et al. 1946) would lead to the impossible activation energy of 
-17 keal. for the reaction of carbon monoxide with adsorbed oxygen to reform 
carbon dioxide. The explanation of the retardation as due to the adsorption of 
carbon monoxide is thus to be preferred on energetic as well as on other grounds. 



372 J. Gadsby, F. J. Long, P. Sleightholm and K. W. Sykes 

The fact that the net energy required for the evaporation of carbon monoxide 
(46kcal.) is greater than the observed activation energy (29kcal.) for the conversion 
of adsorbed oxygen into gaseous carbon monoxide supports the distinction between 
adsorbed oxygen and adsorbed carbon monoxide. 

Discussion 

It is shown below that the main features of the reaction can be interpreted on the 
basis of simple theoretical principles. The general conclusions are compared with 
those reached by ftemcchkova & Frank-Kamenetzky ( 1940 ) about the reaction of 
carbon dioxide with sugar charcoal. 

The active sites. It is reasonable to suppose that the active sites consist of some 
of the less firmly bound carbon atoms, since the latter are best situated, not only to 
form the required bonds with oxygen or carbon monoxide, but also subsequently 
to detach themselves from the rest of the solid. Many of the surface atoms, as, for 
example, those in the middle of a layer of the graphite structure, must be exerting 
their full valency in combination with neighbouring atoms. In a micro-crystalline 
material, however, such lattice planes are of limited extent, and at their edges atoms 
forming fewer bonds will be found. If some of these are thought to be the active ones, 
the fact that they represent only a fraction of the total surface finds a natural 
explanation in terms of the structure of the charcoal. 

The first stage of the reaction . In the initial reaction of t he carbon dioxide molecule 
with the surface an atom of oxygen is adsorbed and a molecule of carbon monoxide 
passes into the gas phase. It might have been expected, however, that the oxygen 
and the carbon monoxide would be adsorbed simultaneously. Carbon monoxide 
itself can be adsorbed alone, and the exothermicity of the change (40kcal.) might 
well have facilitated the dissociation of the carbon dioxide; the fact that the active 
sites are those which do adsorb carbon monoxide strongly would also have been 
explained. Unless the special assumption that each active carbon afom is surrounded 
only by inactive ones is made, the experimental result implies that the carbon 
monoxide does not come into direct contact with the carbon, and consequently that 
the process takes place in an end-on collision of the carbon dioxide molecule with the 
surface. The simultaneous adsorption of the oxygen and the carbon monoxide would 
clearly require a broadside-on collision. That the end -011 configuration is, in fact, 
the most favourable one for reaction is suggested by the work of Buschmann & 
Schafer (1941 ) on the dispersion of supersonic waves in carbon dioxide. It was found 
that translational energy is more readily transferred to the valency than to the 
deformation vibrations of the molecule. It is reasonable to suppose, therefore, that 
the end-on collisions are the most favourable for the conversion of translational 
energy into vibrational energy, and thus for the disruption of the molecule on 
reaction. 

The order of magnitude of the rate of this stage of the reaction can be accounted 
for theoretically as follows. It will be assumed that the change occurs when a carbon 



373 


The mechanism of the carbon dioxide-carbon reaction 


dioxide molecule collides with an active site, provided that energy greater than E* 
is available. For this purpose no allowance need be made, at least to a first approxi¬ 
mation, for the possible effect of orientation. The nature of the process suggests 
fhat the acti vation energy may be shared between the translational motion of the 
carbon dioxide, the internal vibrations of the molecule and the vibration of the 
carbon atom to which the oxygen is transferred. It can be shown that the rate at 
which molecules of mass m and translational energy greater than E collide with 
a surface of area A is equal to 

Ap(l + E/RT)e~zinT 


where p is the pressure of the gas. A standard method (e.g. Fowler & Guggenheim 
1939 , P- 497) then gives for the rate of reaction, when the activation energy E* is 
shared between the translational motion and s modes of vibration, 

Ape-*** r*+i i / jj*y 
\2mnkT)i^r\\Rf) ’ 


Here E*jRT, which is of the order of 30, is much greater than s + 2 , and thus a good 


approximation is 


rate 


Ap 


(2nmkiy (s -t 1)! (ttf) 


E*\ 




C -K*(ICT 


(15) 


Since the experimental results have been represented by an expression of the form 

rate = k 01 p Ci h (1 6 ) 

the true activation energy E* should be equal to E 1 + (s + 1) R7\ and the theoretical 
value of the non-exponential factor should he given by 


1 A ($! + (* + !) RTY+ 1 

01 (2mnkT)*(s'+Tjl[ RT j 


(17) 


The value of s is taken to be 3, since the vibration of the carbon atom perpendicular 
to the surface and the two valency vibrations of the carbon dioxide are the modes 
which might reasonably be expected to contribute to the activation energy. A, the 
total effective area of the active sites, is assumed on the basis of the adsorption 
measurements to be 0*5% of the total surface and thus to be l-5x 
For E t the experimental value of 59kcal. is substituted; as the mean temperature 
of the measurements is 782° C, E* is equal to GKkcal. If the result is expressed in 
g.mol. min.^atm.^gr 1 , the calculated value of & ol becomes i 0 *‘° which is to be 
compared with the observed value of 10 8 ‘ 8 ± oe . The agreement obtained suggests 
that the rate of reaction is determined mainly by the energetic considerations out¬ 
lined above, and can thuB be understood in terms of familiar kinetic principles. 

The second stage of the reaction. General theory suggests that the rate at which 
adsorbed oxygen forms gaseous carbon monoxide should be represented approxi- 
mately by the expression . = ni , e -mm\ (18) 


The quantity v is the frequency of the appropriate vibration, and c-evrt the 
chance that the required energy is available; n represents the amount of carbon 




374 J. Gadsby, F. J. Long, P, Sleightholm and K. W. Sykes 

monoxide equivalent to the total number of active sites, because j s relates the rate 
of reaction to the fraction of the sites occupied. Since the observed activation energy 
of 29 kcal. is equal to E y — 2 £ a , it is probable, in view of the correction made to E v 
that E f is greater by (« + f) RT and is thus equal to 38 kcal. The frequency v should 

consequently be calculated from the relation 

* 

V ( 1 ^) 

On substituting the observed value of the non-exponential factor j oa recorded in 
equation ( 13 ), and the experimental value of 1*5 x 10 ~ 6 g.mol. g.~* for n, it is found 
that v is equal to 10 7 ‘ 3 ^°' 9 sec, b 

A simple kinetic treatment of the evaporation of solids (Frenkel 1946, p. 1) 
identifies v with the frequency of the free vibrations performed by the surface atoms, 
which is presumably about 10 13 sec. ~* 1 . The application of the transition state theory to 
evaporation (Glasstone, Laidler& Eyring 1941, p. 354 ) gives the factor IcT/h f which is 
also of the order of 10 13 sec. -1 at the temperatures under discussion. The calculations 
of Polanyi & Wigner (1928) and of Slater (1939) suggest that, in general, when one 
particular bond must be broken for reaction to occur, v should be of the same order 
of magnitude as the frequency of the normal vibrations. Thus, although the experi¬ 
mental value is not a very accurate one, it would seem that some factor tending to 
reduce the rate of reaction remains to be taken into account. Low probabilities of 
reaction are often due to the necessity for a transition between two different types of 
electronic state. Another possible explanation may lie in the fact that the carbon 
atom carrying the oxygen atom is probably attached to at least two neighbouring 
carbon atoms. More than one bond must therefore be broken, and a relatively rare 
coincidence of the phases of a considerable number of vibrations might consequently 
be required. 

The retardation by carbon monoxide. The conclusion that the retardation is due 
to the adsorption of carbon monoxide is supported by the observed variation of k 2 
with temperature. From the latter it is deduced that the heat of adsorption is 
4(5 kcal., while theoretically a large energy change of this magnitude, which implies 
the formation of a chemical bond with the surface, is required for the gas to be 
strongly adsorbed. The problem of why the sites which react with carbon dioxide 
should necessarily take up carbon monoxide is discussed in the next paper; here it 
will be shown that, given the experimental heat of adsorption, k 2 is of a theoretically 
reasonable order of magnitude. 

The kinetic derivation of the Langmuir isotherm leads to the expression 

rp 

0 = (20) 

1 + ~p 

h 

for the fraction (0) of the active sites occupied at a given pressure (p), where i t 
and j a are the rate constants for adsorption and desorption respectively. As k t has 



375 


The mechanism of the carbon dioxide-carbon reaction 


already been identified with the ratio i s j 2 (equation (14)), it should thus be equal 
to the characteristic oonstant of the Langmuir isotherm. Statistical mechanics 
gives for the non-exponential part of the oonstant 

h? h 3 /„( ads.) 


k °* ( 2 jrmkT)* IcT 8 nHkT /„(gas) ’ 


( 21 ) 


where m is the mass and I the moment of inertia of the carbdn monoxide molecule, 
h is Planck’s constant and f v is the vibrational partition function (Fowler & Guggen¬ 
heim 1939 , p. 427). The product of the vibrational partition functions is unlikely to 
differ from unity by much more than a factor of 10 and is thus not a very important 
term; it can, with sufficient accuracy, be written (1 - e~ hl/ <* lkT ) -5 . The frequency of the 
isolated molecule is assumed to be unchanged on adsorption and the five additional 
frequencies are given an average value, v a , of 1000 cm. -1 . Since the increase of the 
vibrational term and the decrease of the other terms with temperature roughly 
cancel, no correction is made for the temperature dependence of k 02 . If the result is 
expressed in atm. -1 , the calculated value of Jc 02 becomes 10 -10 * 1 , which is roughly 
similar to the observed value of ]()~ 7 ’ 0 to*7. The measure of agreement obtained 
provides further support for the proposed mechanism of reaction. 

Comparison with previous work . The reaction of carbon dioxide with pure sugar 
charcoal, with sugar charcoal containing iron and aluminium and with coke has been 
studied by Semechkova & Frank-Kamenetzky (1940), who used the static method. 
The fact that the behaviour of all these materials was found to follow a rate expression 
of the same form as equation ( 1 ) and could be interpreted by a mechanism similar 
to that deduced for coconut shell charcoal supports the suggestion of a kinetic scheme 
characteristic of carbon in general. A detailed comparison with the present investiga¬ 
tion reveals, however, a number of features which require further comment. 

Although the possibility that only certain sites are active was not mentioned by 
Semechkova & Frank-Kamenetzky, it is not excluded by their experimental results. 
The gas adsorbed was estimated to cover an area similar to that found for the total 
surface by means of methylene blue, but it is now known (e.g, Brunauer 1944 , p. 277) 
that the dye adsorption method usually gives values which are much too low. In the 
scheme proposed for the reaction mechanism, no distinction was made between 
adsorbed oxygen and adsorbed carbon monoxide. The effect of carbon monoxide 
was not, in fact, determined directly, so it was assumed, in order to evaluate the 
rate constants k x and k 3> that 

&aPco + ksPco 2 “ ^ 3 Pco t (22) 

at comparable values of p co and j? 0O| . For coconut shell charcoal from 734 to 
829° C, k 2 decreases from 93 to 13 atm. -1 , while increases from 1*0 to 3*8 atm. -1 ; 
the application of equation ( 22 ) would thus lead to erroneous values for the constants. 
With another type of charcoal, however, the carbon monoxide term might be less 
important. 

For the first stage of the reaction Semechkova & Frank-Kamenetzky found an 
activation energy of 27kcal. and a steric factor of 10 ~ B . Since practically identical 



376 J. Gadsby, F. J. Long, P. Sleightholm and K. W. Sykes 

values were obtained for all the materials studied, it was argued that the process is 
not affected by the state of the carbon surface, and therefore that an energy of 
27 kcal. possessed solely by the carbon dioxide molecule should invariably be 
required. The present investigation gives an activation energy of 68kcal., which is 
thought to be shared among all the degrees of freedom concerned; moreover, no 
steric factor need be introduced. There is much greater similarity between the 
results obtained for the second stage of the reaction. The observed activation energy 
is 61 kcal. for pure sugar charcoal (0*07 % ash) and 23 kcal. for a sample containing 
7% of iron; the corresponding non-exponential factors are 10 9 and 10*sec.” 1 
respectively. This range of values includes the figures of 29koal. and 10 7 sec.” 1 now 
obtained for a coconut shell charcoal of which the iron content (0*15 %) is inter¬ 
mediate between the above extremes. In view of the marked influence of the 
impurities, however, it is clear that the satisfactory correlation of the detailed 
behaviour of different types of carbon must await the elucidation of the fundamental 
features of the catalysis. 

The authors are grateful to Professor C. N. Hinshelwood, F.R.S., for valuable 
advice and criticism. This work forms part of the programme of the Fuel Research 
Board of the Department of Scientific and Industrial Research, and this paper is 
published by permission of the Department. One of the authors (J.G.) was a member 
of the Department's staff. The illustrations are Crown copyright and are published 
by permission of the Controller of His Majesty's Stationery Office. 


References 

Broom, W. E. J. & Travers, M. W. 1932 Proc. Boy. Soc. A, 135, 512. 

Brunauer, S. 1944 The adsorption of gases and vapours. Vol. 1 , Physical adsorption . Oxford 
University Press. 

Buschmaim, K\ F. & Schafer, K. 1941 Z. phys . Chem. B, 50, 73. 

Coates, G. E. 1944 J . Set. Instrum . 21 , 86 . 

Fowler, K. H. & Guggenheim, E. A. 1939 Statistical thermodynamics . Cambridge University 
Press. 

Frenkel, J. 1946 Kinetic theory of liquids . Oxford University Press. 

Gadsby, J., Hinshelwood, C. N. & Sykes, K. W. 1946 Proc. Roy. Soc . A, 187, 120. 

Gadsby, J. & Sykes, K. W. 1948 Proc . Roy. Soc . A, 193, 400. 

Glasstone, S., Laidler, K. J. & Eyring, H. 1941 The theory of rate processes . New York: 
McGraw-Hill. 

Langmuir, I. 1915 J . Amer. Chem . Soc . 37, 1130. 

Polanyi, M. <fe Wigner, E. 1928 Z. phys. Chem , A, 139, 430. 

Sernechkova, A. F. & Frank-Kamenetzky, D. A. 1940 Acta Physicochim. U.R.S.S. 12, 879. 
Slater, N. B. 1939 Proc . Camb. Phil. Soc . 35, 56, 



The mechanism of the steam-carbon reaction 

By F. J. Long and K. W. 'Sykes, Physical Chemistry Laboratory , Oxford 

(Communicated by C. N. Hinshelwood, F.lt.S .— ■Received 27 November 1947 

—Read 29 A'pril HH8) 


The detailed mechanism of the reaction between steam and coconut shell charcoal has been 
utudiad by the method described in the preceding paper. The temperature has been varied in 
the range 680 to 800° C and the pressures of the gases from 10 to 760 mm. 

Steam first reacts with the carbon to give oxygen and hydrogen atoms separately adsorbed 
on neighbouring sites. An initial dissociation into an adsorbed hydrogen atom and an adsorbed 
hydroxyl radical is probably followed by the more rapid transfer of the second hydrogen atom 
to the carbon. Only about 2 % of the total surface takes part in the reaction; those sites are 
distinct from the smaller group which reacts with carbon dioxide, but they are also thought to 
be atoms at the edges of lattice planes. The rate of tile first, stage can be accounted for by 
assuming that reaction occurs in those collisions in which the combined energy of the incident 
steam molecule an<%the two active carbon atoms exceeds 75 koah Adsorbed hydrogen evap¬ 
orates rapidly, but in the steady Btate much remains on the surface, A close correlation has 
been observed between the fraction of the active sites occupied by hydrogen and the extent* 
to which the reaction is retarded by that gas. Adsorbed oxygen reacts much more slowly to 
form gaseous carbon monoxide; the latter, which has no retarding effect, is not appreciably 
adsorbed by the sites accessible to steam. The activation energy for the conversion of an 
adsorbed oxygen atom into gaseous carbon monoxide is found to be 55 keah, and the non¬ 
exponential factor to be I0 11:M ‘ 7 sec." 1 which may ho compared with the value of 10 lft «oc.“ 1 
predicted by simple theory. 

As the active carbon atoms are thought to be exerting less than their maximum valency, it 
is suggested that the two types differ in the number of extra bonds which they can form. 
Energetic considerations show that whereas those which can form a single bond should react 
with steam, only the relatively few capable of forming a double bond should react with 
carbon dioxide. This theory also explains why hydrogen is strongly adsorbed by both the 
steam and the carbon dioxide sites, but carbon monoxide only by the latter type. 

The relation of these views to outstanding problems of the oxygen-carbon and nitrous 
oxide-carbon reactions is discussed, and an explanation of the main kinetic features of those 
processes is given. 


Introduction 


It has previously been shown (Gadsby, Hinshelwood & Sykes 1946 ) that carbon 
monoxide is the primary product of the action of steam on charcoal, and that the 
rate of reaction depends on the partial pressures of steam and hydrogen according 
to the relation 


rate — 


&[i PjUQ 


1 + ^3pH t o 


( 1 ) 


In the present paper an investigation of the detailed mechanism of the process is 
described. The method is analogous to that explained in the preceding paper on the 
carbon dioxide reaction; the adsorption of the gases is studied by the static technique 
and the variation of the rate of reaction with temperature is measured by the flow 
technique. In the discussion an attempt is made to account for the mechanisms of 
both the steam and carbon dioxide reactions by the same fundamental considerations. 

[ 377 ] 





378 


F. J« Long and K. W. Sykes 

Experimental technique 

The improved flow apparatus in which the spatial distribution of temperature is 
controlled automatically is used without further change. 

In the static apparatus provision is made to introduce a known amount of steam 
and to analyze the resulting gaseous mixture. Each experiment is made with 10 g. 
of charcoal in a reaction vessel of volume 290 c.c. Water, freed from dissolved air by 
repeated freezing and evacuation, is vaporized into a storage vessel immersed in an 
oil thermostat at 75° C and fitted with a mercury manometer; the decrease in pres¬ 
sure of tho latter measures the amount of steam subsequently transferred to the 
reaction vessel. All manometers and other tubing outside the thermostat are kept 
at about 80° C by electrical heating. Trial experiments in the absence of charooal 
showed that up to 260 mm. of steam could be admitted to the storage vessel and 
shared to give 170 mm. in the reaction vessel without detectable condensation. Gas 
samples are withdrawn through a phosphorus pentoxide tube into an evacuated 
receiver. The total amount of gas extracted is measured by the decrease of pressure 
in the reaction vessel; the amount left after the removal of the steam by the phos¬ 
phorus pentoxide is proportional to the pressure developed in the receiver, which is 
observed with respect to vacuum by means of a butyl phthalate manometer. By 
comparison with calibration experiments, in which argon is withdraw™ from the 
reaction vessel, a reasonable estimate of the steam pressure can easily be obtained. 
The remaining mixture of hydrogen, carbon monoxide and carbon dioxide is 
analyzed with a Haldane apparatus. 

All other features of the experimental technique are as described in the preceding 
paper. Since it has been shown that the carbon dioxide reaction is not appreciably 
affected by diffusion in either the flow or the static experiments, it will be assumed 
that under similar conditions the same is true for the steam reaction, the rate of 
which is of the same order of magnitude. 

Results 

(1) The static method 

The course of reaction 

Figure 1 illustrates the results of a series of experiments in which the same amount 
of steam is allowed to remain in contact with the charcoal for a series of different 
times; the curves are typical of those obtained at pressures of 50 to 150mm. in the 
temperature range 700 to 800° C. In the upper part of the figure the partial pressures 
of the gases are plotted against time. The initial pressure, jPh*o> the pressure 
which the steam would exert if neither adsorption nor reaction occurred, and is 
calculated, on the basis of calibration experiments with argon, from the amount of 
steam introduced. The lower part of the figure shows the corresponding changes in 
the amounts of oxygen and hydrogen adsorbed, taken respectively to be 

< ~ (PH a o +Pco + 2 Pco,) and 



379 


The mechanism of the steam-carbon reaction 

The amounts adsorbed are thus expressed as the pressures which they would exert 
in the gas phase, the hydrogen as such and the oxygen as carbon monoxide. These 
stoichiometric definitions make no assumptions about the state of the oxygen and 
hydrogen on the surface, and thus include that present either as steam or in any 
other form. 

At first the pressure falls rapidly as steam is adsorbed. If the steam molecule as 
such was the only adsorbed species, the amount of oxygen adsorbed would be equal 
to the amount of hydrogen adsorbed throughout the reaction. Figure 1 shows, 
however, that the two quantities are by no means equal and that they vary with 
time in entirely different ways. Although it is clear that much of the surface is 
covered by hydrogen in the later stages, the possibility of explaining the results in 
terms of steam and hydrogen as the adsorbed species is ruled out by the observation 
that initially the oxygen is in excess of the hydrogen. It is concluded, therefore, that 
steam is dissociated into oxygen and hydrogen separately attached to neighbouring 
sites. This decomposition of the steam molecule may be regarded as a natural result 
of the fact that adsorption at high temperatures requires the formation of ohemical 
bonds with the surface. The simplest, and consequently the most probable, 
mechanism is the initial formation of an adsorbed hydrogen atom and an adsorbed 
hydroxyl radical, followed by the transfer of a hydrogen atom from the hydroxyl 
to the carbon; the first stage of the reaction may thus be written: 

H 2 0^(0H)(H)^(0)(H 2 ). ( 2 ) 

Hydrogen evaporates rapidly, as shown by the sharp initial rise in its partial 
pressure, but a steady state in which a considerable proportion remains on the 
surface is soon reached. The continual increase during the later stages in the amount 
of hydrogen adsorbed for a given partial pressure suggests that this gas progressively 
occupies the sites left vacant by the reaction of the adsorbed oxygen. It is therefore 
very probable that the adsorption of hydrogen is the factor which causes the reaction 
to be retarded by that gas; the suggestion that the adsorption of steam requires its 
dissociation into oxygen and hydrogen explains why steam and hydrogen compete 
for the same sites. The part played by hydrogen can thiis be expressed by writing: 

H 2 ^(H 2 ). (3) 

The adsorbed oxygen leaves the surface much more slowly, as can be seen from 
the relative pressures of carbon monoxide and hydrogen during the first few minutes. 
In the end, however, it is almost completely converted into gaseous carbon monoxide, 
together with a small proportion of carbon dioxide, and the amount adsorbed falls 
to less than 2 n^m. Figure 9 of the preceding paper shows that when carbon monoxide 
is present alone, at a pressure of 62mm,, the final value reached in figure 1 , the 
amount adsorbed is 12 mm. It must thus be supposed that the sites which normally 
take up carbon monoxide are now preferentially occupied by hydrogen. The fact 
that under the present conditions carbon monoxide has little tendency to return to 
the surface explains the observation (Gadsby, Hinshelwood & Sykes 1946 ) that this 


Vol. 193 . A. 


25 



380 


F* J. Long and K. W* Sykes 

gas does not retard the steam reaction. The results also support the previous con¬ 
clusion that the carbon dioxide is formed secondarily from carbon monoxide and 
steam. At these pressures the latter reaction is of approximately the first order with 



Figure 1 . The course of reaction at an initial steam pressure of 100 mm. at 750° C: 
partial pressures in the gas phase and adsorption on the carbon surface. 

t 

respect to each gas, so the proportion of carbon dioxide rises as the pressure of 
carbon monoxide increases and then falls on account of the decrease in the steam 
pressure. The remaining stage of the reaction can thus be represented by the equation 

C + (OKCO* > (4) 



381 


The mechanism of the steam-carbon reaction 

It can be shown that the mechanism proposed above is consistent with the 
dependence of the amounts adsorbed during the oourse of the reaction on tem¬ 
perature and pressure. The changes which occur when steam is introduced at a 



Fioube 2. Adsorption on the carbon surface during reaction at a series of different steam 
pressures at 750° C. Initial pressure of steam: O 150 mm.; 9 100 mm.; • 55 mm. 

series of different pressures are illustrated in figure 2, where the amounts of oxygen 
and hydrogen found on the surface are plotted against time in the upper and lower 
portions of the diagram respectively. Both these quantities increase as the initial 
pressure is raised, the hydrogen almost proportionally and the oxygen to a slightly 



382 F. J. Long and K. W. Sykes 

smaller extent, Such variations are to be expected, since the reaction is of nearly 
the first order with respect to steam at these pressures and is progressively retarded 
as the pressure of hydrogen is increased. 



Fioubk 3. Adsorption on the carbon surface during reaction at a series of different 
temperatures. Initial steam pressure 150 mm. O 800° C; (J 750° C; 0 700° C, 


Figure 3 shows, in a similar way, the effect of introducing a given pressure of 
steam at a series of different temperatures. Although an increase of temperature from 
700 to 800° C raises the initial rate of adsorption, it does not alter very markedly 
the amounts on the surface during the greater part of the reaction. However, when 



383 


The mechanism of the steam-carbon reaction 

allowance is made for the decrease of the overall pressure of steam caused by the 
increase in the rate of reaction, the amount of oxygen adsorbed for a given pressure 
of steam is found to be greater the higher the temperature. This conclusion is in 
good agreement with the flow experiments, which yield a higher activation energy 
for the initial transfer of oxygen to the surface (75kcal.) than for itB subsequent 
evaporation as carbon monoxide (55 kcal.). It can also be seen from figure 3 that the 
amount of hydrogen finally adsorbed is roughly independent of temperature; this 
observation is consistent with the lack of variation revealed by the flow experiments 
in the extent to which the reaction is retarded by hydrogen (fc a , figure 9). 

The adsorption of hydrogen 

A known quantity of the gas is introduced, and the difference between the observed 
pressure and the calculated initial value, which represents the amount adsorbed, 
is plotted against time in figure 4 . No correction need be made for the comparatively 
slow formation of methane, since even the equilibrium proportion does not exceed 
1 % under the present conditions. It is clear that hydrogen itself is strongly adsorbed 
at a rate similar to that of steam, as required by the proposed mechanism of reaction. 
The typical carbon monoxide curve, which is included for comparison, emphasizes 
that this latter gas combines with the charcoal, not only much less rapidly, but also 
to a considerably smaller extent. 



Figure 4 . The rate of adsorption of hydrogen. O 800° C; • 760° C. 

In figure 5 the amount of hydrogen adsorbed after a period of 20 min. is plotted 
against the pressure of that remaining in the gas phase. Although during this time 
the rate of adsorption falls to a very small fraction of its initial value, equilibrium is 
not reached. For the present purpose, however, the subsequent changes can be 
neglected because they are too slow to play any appreciable part in the reactions 




384 


F. J. Long and K. W. Sykes 

under consideration. To show that the experimental points can be represented 
adequately by the Langmuir isotherm, the curves are calculated from the equation 

amount adsorbed = q = - 1 -. (5) 

The values of the constants q 0 and b, which are obtained from the straight line drawn 
by the method of least squares on the graph of p a Jq against p Bf , are recorded in 
table 1; the probable errors are calculated from the residuals in the usual way. 



7 , a,(mm.) 

Fioubk 5. The amount of hydrogen adsorbed. Comparison of experimental points with 
curves calculated according to equation (5) from the values of the constants given in table 1. 
O 850° C; 3 800° C; 0 750° C. 

Table 1 


temp. 

<7o 

b 

(° C) 

(g.mol. g.- 1 x 10 s ) 

(atm.” 1 

750 

3*8 ± 0*2 

27 ±3 

800 

4*0 ± 0*2 

42 ± 8 

850 

4*5 ± 0*1 

50±7 


For a given pressure of hydrogen, the amount adsorbed under these conditions 
is about 90 % of that found on the surface during the steam reaction. There, however, 
equilibrium is approached from the other side, as the hydrogen is originally formed 
in the adsorbed state. The quantity q 0 represents the limiting amount adsorbed at 
high pressures and is thus a measure of the total number of sites available. It tends 
to increase slightly as the temperature is raised, but is roughly equal to 4 x 10~» 
g.mol./g. of charcoal. The pronounoed curvature of the isotherms suggests that the 




385 


The mechanism of the steam-carbon reaction 

hydrogen adsorbed at these pressures occupies a considerable fraction of the active 
surface, and is consequently oapable of causing the observed retardation. As the 
adsorption of a gas is accompanied by a marked decrease of entropy, the process 
will take place to an appreciable extent at high temperatures only if it is strongly 
exothermic; the constant 6 , which is a measure of the tendency for adsorption, should 
thus decrease exponentially as the temperature is raised. Here, however, b tends to 
vary in the opposite sense, so it must be supposed that a true steady state is not 
established. This explanation is supported by the work of Barrer ( 1935 ), who dis¬ 
covered that the time required for equilibrium to be reached is as long as 2 or 3 days; 
from the temperature dependence of the results so obtained, tho heat given out on 
adsorption was calculated to be 50kcal. The magnitude of this value shows clearly 
that the strong adsorption of the hydrogen molecule is due to its dissociation into 
atoms, each of which forms a chemical bond with a previously unsaturated carbon 
atom. Barrer also found that only a small fraction of the total surface is involved, 
and that the Langmuir isotherm is obeyed; another paper (Barrer 1936a) discusses 
the conditions under which adsorption with dissociation leads to an isotherm of 
this type. 

The active sites 

Carbon dioxide was found to react not with the whole of the charcoal surface, but 
only with certain active sites representing about 0*5 % of the total area. The pro¬ 
perties attributed to those sites will not, however, account for the results described 
above. For example, carbon monoxide retards the action of carbon dioxide by being 
strongly adsorbed and yet has no appreciable effect on that of steam. It is concluded 
that steam reacts predominantly with a different type of site, which evidently has 
little tendency either to take up carbon monoxide or to be attacked by carbon 
dioxide. Hydrogen, of which up to 4 x 10 ~ r, g.mol. gr 1 can be adsorbed as compared 
with only 1-5 x 10 ~ 5 g.mol. g.~ l of carbon monoxide, must be supposed to occupy 
both groups of sites, in order to explain why the carbon monoxide formed in the 
steam reaction does not return to the surface to any appreciable extent. This con¬ 
clusion is supported by the previous observation (Gadsby et al . 1946 ) that the 
carbon dioxide reaction is retarded by hydrogen as well as by carbon monoxide. 
Since considerations to be discussed later suggest that one type of site can accom¬ 
modate either two atoms of hydrogen or a molecule of carbon monoxide and the 
other only one atom of hydrogen, the above figures lead to an estimate of 3*3 for the 
ratio of the numbers of sites in the two separato groups. * 

If a given fraction of one set of sites is occupied during the steam reaction, then 
approximately the same fraction of the other set should be occupied under analogous 
conditions in the carbon dioxide reaction; this conclusion follows from the similarity 
of the kinetic expressions. The maximum values of the total amounts of gas adsorbed 
in the course of the two reactions are compared in table 2 . 

These quantities are not equal to each other, but the last column shows that their 
ratio is roughly independent of both temperature and pressure; they can thus be 



386 


F. J. Long and K, W. Sykes 


considered to represent similar fractions of two different groups of sites. The average 
value of the ratio recorded in table 2 being 4*3, roughly 3*3g.mol. are adsorbed on 
the steam sites for every 1 g.mol. adsorbed on the carbon dioxide sites. Since about 
half the former amount consists of hydrogen, one molecule of which requires two 
sites, the carbon atoms accessible to steam appear to outnumber those accessible 
to carbon dioxide by a factor of 5*0. Comparison with the estimate of 3*3, obtained 
above, suggests that the correct value is about 4, so it is concluded that the sites 
which react specifically with steam account for approximately 2 % of the charcoal 
surface. 




Table 2 





total amount adsorbed 



initial 

HjO 

o 

r 


temp. 

pressure 

reaction 

reaction 


r c) 

(mm.) 

(nun.) 

(mm.) 

ratio 

800 

150 

128 

38 

3*6 

750 

150 

123 

26 

4-7 

700 

150 

113 

32 

3*5 

750 

100 

87 

21 

4*1 

750 

55 

65 

12 

5*4 



(2) The flotv method 




The rate of reaction 


Measurements are made at five temperatures in the range 080 to 770° C. The 
variation of the rate of reaction with the partial pressure of steam is determined by 
passing through the charcoal column a series of steam-nitrogen mixtures, the total 
rate of flow being kept oonstant at 1200c.c./min. Since carbon dioxide is formed 
secondarily from carbon monoxide and steam, the rate of the primary process is 
taken to be the sum of the rates at which the two oxides are produced. To study the 
effect of hydrogen, various small partial pressures of it are added, the partial pressure 
of steam and the total rate of flow being kept constant as before. Neither carbon 
monoxide nor carbon dioxide has any appreciable effect on the rate of this reaction, 
and the partial pressure of the latter gas is insufficient for its reaction with the 
charcoal to be significant. Typical results are illustrated in figure 6, where the rate of 
reaction per gram of charcoal is plotted against the initial pressures of steam and 
hydrogen. 

It can be shown that the expression 


* 


rate 


__ 

1 + kfPn, + &*Ph»o 


(1) 


is closely obeyed over the whole temperature range. If the conditions (i) to (iii) 
enumerated in the preceding paper are fulfilled, and if in addition /fc a p&, 0 > 1 and 
p5a,o >!>&,. equation (1) leads to the relation 


m 1+feipLo &«Ph, 
x ~ k + k ' 


( 6 ) 



387 


The mechanism of the steam-carbon reaction 

The quantity x is now the fraction of steam reacting directly with the oharooal and 
f) Bt is the average partial pressure of hydrogen, including that formed from carbon 
monoxide and steam; m and k retain their previous definitions. In the present experi¬ 
ments the total rate of flow does not vary by more than 0 %, the rates of diffusion 
lie between 0*1 and 20 % of the rates of flow, while which is rarely greater than 
0*10, never exceeds 0*20; the average value of k 2 p^ t o is about 15 and Ph^o/Pii, is 
not less than 1*4. The conditions under which equation (6) is applicable are thus 
fulfilled to a close approximation. 



Figure 6 . Apparent order of reaction with respect to steam and retardation by hydrogen. 
Initial pressure of steam: © 380 mrn.; © 253 mm. 


Figure 7 shows that the straight lines required by equation (6) are obtained when 
mjx is plotted against p Hl . The relatively small variations of the slope at any given 
temperature do not depend systematically on Ph,o> arK * are P r °hably due to lack 
of complete reproducibility from one experiment to another. In figure 8 the inter¬ 
cept m/x 0 at p H# * 0 is plotted against pjjr, 0 ; these intercepts are calculated by 
subtracting k % p n Jk from the values of mjx obtained in the experiments in which 
hydrogen is not added. It can be seen that the effect of varying the steam pressure 
is adequately represented by the linear relation 

m _ 1 +Ar 3 pH,o 

*o” * ’ 

which follows from equation (0). 


( 7 ) 




388 


F. J. Long and K. W. Sykes 

Values of the constants and their probable errors are calculated by the method of 
least squares in the manner described previously. The logarithms of k v k t and k z 
are plotted against the reciprocal of the absolute temperature in figure 9, where the 
radii of the circles are equal to the probable errors. The variation of k t approximates 



Figube 7. Test of rate expression: variation of hydrogen pressure. Initial steam 
pressure: O 633 mm.; 3 508 mm.; © 380 mm.; © 253 mm.; # 127 mm. 


closely to an exponential increase with temperature. Although there is no doubt 
that also increases as the temperature is raised, the probable errors are larger and 
the points are consequently more scattered; a straight line has, however, been drawn 
by the method of least squares. The values of k % do not show any significant change. 
A concise summary of the results may thus be given as follows: 




389 


The mechanism of the steam-carbon reaction 

k x m 1!c 0l e~ x * /XT j where k 01 — 10 10 * 2 ± 09 g.mol. min." 1 atm." 1 g.™ 1 , 
and JS 1 = 62-3 ± 4*1 kcal. g.mol."* 1 ; 

k % ^ 33 atm." 1 ; 

k z « k 0 < i e~ JB: * IRT i where k 03 - 10 4 ‘ 6 ± 1 ' l atm.~ 1 , 

and E 3 = 20*1 ± 6*2kcal. g.mol.” 1 . 



Figure 8. Test of rate expression: variation of steam pressure. 

The rates of the individual stages 

The experiments with the static apparatus suggest that the important elementary 
processes are the following: 


H s 04 ^( 0 H)(H)-iU( 0 )(H a h 

(8) 

h 

(») 

c+to-^co. 

(10) 




390 


F. J. Long and K. W. Sykes 

It will be shown below that the retardation by hydrogen can satisfactorily be 
accounted for by the extent to which the gas is adsorbed; the reverse of the second 
step in equation (8), which represents an additional way in which hydrogen can 



Figure 9, Variation of k lt k t and k 8 with temperature. 


reduce the rate of reaction, is thus neglected. The conditions for a steady state may 
be expressed in general terms by the relations 

1 iPH|0 (1 “ d a ) = (ii+jj)#!, (11) 

HPb, (1 - - 0 3 ) + »i*i = kK (12) 

(13) 

where 8 V and 0 3 are respectively the fractions of the active surfaoe occupied by 
(OH) (H), (H s ) and (0). No allowance has been made for the different numbers of 
sites which the various speoies require, since this cannot be done without detailed 
knowledge of the spatial distribution of the active oarbon atoms. This approximation 
is not, however, a very drastic one. The rate of reaction in the steady state is given by 


rate = 





h*iPH t o 

i'l+Ji 


(14) 




391 


The mechanism of the steam-carboy, reaction 

which is of the same form as the experimentally determined rate expression, equa¬ 
tion ( 1 ). It is thus possible to evaluate the rates of some of the individual stages by 
comparing the corresponding terms in the two expressions. 

The numerators yield the relation 


hjj 

i'l+h 


— k v 


(15) 


Now Muller & Cobb ( 1940 ) found that when charcoal on which steam has been 
adsorbed at 300° C is heated under vacuum to 1050° C, the steam iB not recovered 
as such but only as hydrogen and oxides of carbon. The resu Its obtained by Strickland- 
Constable ( 1947 ) also show that above 700° C adsorbed steam reacts much more 
quickly than it evaporates. It follows that under the present conditions j lt the rate 
constant for evaporation of steam, is small oompared with i' x which refers to an 
essential step in the comparatively rapid evolution of hydrogen. The rate of the first 
stage of the mechanism may thus be evaluated by writing equation (15) in the form 


i x = k x « 1 0 10‘2 ± 0.9 e -*e2.300±4.i00/i?T g mo p m i n -1 atra - l g" 1 . (16) 

The magnitude of the non-exponential factor can be explained by the simple assump¬ 
tion that reaction occurs when a steam molecule collides with an appropriate site, 
provided that the total energy available is greater than E*. The theoretical expression 


k 


01 


± WL +frjJ a H1 e -, t „ 

(2in»Ar)*(*+l)!l RT J 


(17) 


which was deduced for the corresponding factor in the carbon dioxide reaction, 
is applicable here if A is now the area represented by the sites accessible to steam 
and m is the mass of the steam molecule. Since the vibrations of the two active 
carbon atoms perpendicular to the surface and the three normal modes of the 
steam molecule are likely to contribute to the activation energy, the value of 8 
is assumed to be 5. E*, which is equal to E 1 + {s+^) RT, thus becomes 75kcal. 
A is taken to be 6 x 10 *cm. a g. -1 , since it has been found by the static method to 
represent about 2 % of the total surface area. The value of k 01 calculated on this 
basis is 10 M g.mol. min . -1 atm . -1 g. -1 , in good agreement with the observed value 
of l 0 10 *±®-» in the same units. 

Another relation between the constants in equations ( 1 ) and (14) is 


1 

1 1 1 V 

H J a J 3 


(18) 


The fact that after the adsorption of steam hydrogen is evolved much more rapidly 
than carbon monoxide implies that both and j 2 are large compared with j 3 . Con¬ 
sideration of equations ( 8 ) to ( 10 ) shows that whichever of and j t is the smaller 
should be approximately equal to j t multiplied by the ratio of the initial rates of 
formation of the two gases; reference to figure 1 emphasizes that this ratio is large. 



302 


F. J. Long and K. W. Sykes 

It is thus reasonable to neglect both l/i[ and l/j s in comparison with ljj 9) and to 
write equation (18) in the form 

j z *» k x /k 3 = l() fi, 7 ± 1 ' 7 F” 4 af 800 ± 7 ' 400 /OT g.mol. min .” 1 g.” 1 . (19) 

This expression for the rate at which adsorbed oxygen atoms evaporate as carbon 
n onoxide may be compared with the theoretical relation 

j z - ( 20 ) 

where v is the appropriate vibration frequency, E f the activation energy, and n the 
amount of carbon monoxide equivalent to the total number of sites available. Since 
the observed activation energy of 42kcal. is given by E x — E& the correction of 
(s + f) RT applied to E x makes E f equal to 55kcal. The quantity n has been esti¬ 
mated to be roughly four times the corresponding figure for the carbon dioxide 
reaction, and is consequently taken to be 6 x 10 ~ 8 g.mol. g.” 1 . If the observed value 
of the non-exponential factor j 03 is now substituted in the relation 

v * ( 21 ) 

the frequency v is found to be 10 11 ‘ 0 ± 1 ‘ 7 sec.~ 1 . Tliis value is in reasonable agreement 

with the figure of 10 13 sec ." 1 predicted by the simple theory outlined in the preceding 
paper. The fact that the values of E f and v are not the same as those obtained for 
the analogous stage of the carbon dioxide reaction (38 kcal. and 10 7 sec.” 1 ) can readily 
be understood in terms of the idea that two distinct groups of sites are involved. 
Since the observed non-exponential factor is markedly smaller than the theoretical 
figure only for the carbon dioxide reaction, it is tentatively suggested that the dis¬ 
crepancy is due to a non-adiabatio transition. The other possible explanation of a 
low-frequency factor mentioned in the discussion of the carbon dioxide results seems 
unlikely to make the reaction probability depend so specifically on the type of 
site concerned. 

Comparison of the hydrogen retardation terras in equations ( 1 ) and (14) implies 

ijj % = 33 atm.” 1 . (22) 

As i 2 and represent respectively the rates of adsorption and desorption, it follows 
from the kinetio derivation of the Langmuir isotherm (see equation ( 20 ) in the 
preceding paper) that k % should be equal to the constant b in equation ( 5 ). The ob¬ 
servation that the adsorption of hydrogen can be described by an isotherm of this 
type is thus consistent with the form of the rate expression. It remains, however, 
to compare the numerical values of k z determined by the flow technique with those 
of b calculated from the static adsorption measurements. Table 1 shows that b is 
equal to 27 ± 3 atm ." 1 at 750° C and 42 ± 8 atm ,” 1 at 800° C; the values of fc a observed 
between 680 and 770° C all lie in the range 23 to 40 atm .” 1 and their average is 
33 atm r l . Although equilibrium may not be approached to quite the same extent 
in the two sets of experiments, the reasonable agreement obtained is thought to 
confirm the conclusion that the retardation caused by hydrogen is due to its 
adsorption. 



The mechanism of the steam-carbon reaction 


393 


Discussion 

It is now possible to summarize the main features of the reactions of charcoal with 
steam and with carbon dioxide, and to see how far they can be interpreted on the 
basis of general principles. 

The mechanisms are fundamentally similar, each overall process taking place in 
two main stages. A molecule of steam or carbon dioxide is first decomposed by the 
transfer of an oxygen atom to the carbon surface; the oxygen atom then evaporates, 
together with an atom of carbon from the solid, to form gaseous carbon monoxide. 
There is a significant difference of detail between the initial stages of the two reactions, 
but this can largely be attributed to the different structures of the molecules con¬ 
cerned. Whereas carbon dioxide is decomposed into an atom of oxygen which is 
adsorbed and a molecule of carbon monoxide which passes into the gas phase, the 
whole of the steam molecule is taken up by the charcoal. However, the dissociation 
of carbon dioxide gives an atom with a considerable affinity for carbon and a stable 
molecule which is itself only slowly adsorbed, while that of steam yields a hydrogen 
atom and a hydroxyl radical, each of which is expected to combine readily with 
the carbon. 

Both steam and carbon dioxide attack $>nly a small fraction of the total surface 
of the charcoal; this is a natural consequence of the micro-crystalline structure of 
the material, since a small but by no means negligible proportion of the surface atoms 
will be situated at the edges of the individual lattice planes, and will therefore be 
exceptionally reactive on account of their weaker binding to the rest of the solid. 

The rates of reaction can also be understood in terms of familiar principles. Given 
the experimental activation energy and the number of active sites determined by 
the adsorption measurements, the rate of the first stage of each reaction can be 
calculated to within the nearest order of magnitude from simple kinetio theory. 
A similar treatment of the second stage yields theoretical rates which are greater 
than those observed, but the discrepancy can be ascribed to complicating factors 
likely to be significant in these reactions. 

The more general features of the mechanism outlined above follow in a straight¬ 
forward manner the tendency for reactions to take place in a series of simple steps, 
each of which involves the minimum disturbance of the existing molecular structure 
(Hinshelwood 1947 ). There are, however, some important observations which, 
though susceptible of treatment by the same general principles, require a more 
detailed approach. The fact that steam and carbon dioxide react predominantly with 
different parts of the charcoal surface is the most outstanding. Moreover, the 
retarding effects of the products have been shown to be caused by chemical adsorp¬ 
tion, but their variation from one reaction to another dearly depends on the specific 
properties of each set of sites. It will be recalled that the action of steam is retarded 
only by hydrogen, while that of carbon dioxide is retarded by both carbon monoxide 
and hydrogen. The proposed mechanism indicates that steam and hydrogen should 
compete for the sites which can take up hydrogen atoms. An analogous explanation 



394 


F. X Long and K. W. Sykes 

is not, however, available for the carbon dioxide reaction, because there the initial 
decomposition of the reactant does not involve the adsorption of carbon monoxide 
or, of course, of hydrogen. All these problems raise the question of the essential 
difference between the two types of site. 

The, nature of the active sites 

Since the active sites are thought to consist of carbon atoms which are not exerting 
their full valency, there is a possibility that the two types might differ in the number 
of additional bonds which they could form. It will be shown that their behaviour 
can, in fact, be explained in that way. 

The mechanism deduced for the steam reaction may be expressed in terms of 
carbon atoms which are attached to the rest of the lattice by only three bonds, and 
thus have one further orbital available for interaction with other molecules: 

OH H o 

A c A^A 0 /s +h --^ co v c \ +h ‘ 

Hjoy I I I 

/ 

(23) 

Such sites are likely to be found at the free edges of either the lamellar or the cross- 
linked aromatic ring systems (Riley 1947 ) which are the basis of the charcoal struc¬ 
ture. Evidence for the existence of the unpaired electrons which should be associated 
with atoms of this type oomes from the work of Bonhoeffer, Farkas & Rummel 
( 1933 ). These authors discovered that even charcoal of the highest purity is a good 
catalyst for the ortho-para hydrogen conversion at low temperatures, and so con¬ 
cluded that the carbon surface must itself be paramagnetic. 

A straightforward summation of the energies} of the bonds shown in equation 
(23) makes the adsorption of steam and hydrogen exothermic to the extent of 70 
and 108kcal. respectively. The value for hydrogen is considerably greater than the 

% The following bond strengths (heat contents at 25 ° C) are used throughout this discussion: 
H—H 104 kcal. (Herzberg 1939); C—H 106 kcal. for trigonal hybridization (Walsh 1947) 
and 104 koal. for the tetrahedral state (Long & Norrish 1946); 0 —-O 86 kcal., C ™0 184 koaJ. 
(Walsh X947); C—C 05 kcal. (Long & Norrish 1946), Ck==C 153 kcal. (Walsh 2946). The dis¬ 
sociation energy of the carbon monoxide molecule into atoms in *P states is taken to be 
211 kcal. (Herzberg X939)> so the bond strength with respect to *S carbon, the reference state 
for the other carbon bonds, becomes 277 kcal. (Long <fe Norrish 1946). Other data required 
are the heat content changes for the reactions H ,0 = OH-f*H and CO, = CO + O; these ore 
found from the information collected by Lewis & von Elbe (1935) to be 118 and 127 kcal. 
respectively. Although some of these values may be subject to future revision, it is most 
unlikely that the general conclusions will be seriously affected. 




The mechanism of the steam-carbon reaction s 395 

experimental figure of 50 kcal. (Barrer 1935 ), but it is only to be expected that, 
since the vacant sites are of the nature of free radicals, they will be stabilized in 
some degree by resonance. Although the precise canonical states cannot easily be 
specified, a resonance energy equal to the discrepancy of 29 kcal. per active carbon 
atom is not improbable. The evolution of gaseous hydrogen from adsorbed steam, 
which is thought to involve the intermediate transfer of the second hydrogen atom 
to the carbon surface, has been represented by a single stage in equation (23) merely 
for convenience of illustration. The bond to the oxygen atom then remaining will 
clearly have some tendency to become a double one at the expense of a corresponding 
reduction in the order of ft carbon-carbon link. A net increase of 40 kcal. in the 
strength of the individual bonds would so be obtained, but against that must be set 
the lack of resonance energy in the resulting quinonoid form, since even a single 
benzene ring is stabilized by 36 kcal. (Kistiakowsky, Ruhoff, Smith & Vaughan 
1936 ). It is thus a fair approximation for energetic purposes to regard this type of 
carbon atom as capable of forming only a single bond with any adsorbed atom or 
group. Radicals with a structure similar to that suggested for the adsorbed oxygen 
atom have long been known to exist in solution (for a summary, see Waters 1946 ). 
The driving force for the final stage of the reaction comes from the formation of the 
particularly strong bond in the molecule of carbon monoxide which is evolved. 

The fact that the sites which react with steam do not adsorb carbon monoxide 
can also be accounted for by the present treatment. Even if the vacant site was not 
stabilized in any way, the process 

^C + CO->^C—C=0, (24) 

which appears to represent the most favourable possibility, would lead to the 
evolution of only 2 kcal. As a gas is appreciably adsorbed at high temperatures only 
if the change is strongly exothermic, it follows that the adsorption of carbon monoxide 
should be negligible compared with that of steam (76kcal.) or hydrogen (108 kcal). 

If the initial step in the carbon dioxide reaction is formulated in terms of the sites 
discussed above, it becomes 

^C + CO a -*)>C—O + CO, (25) 

which is no less than 117 kcal. more endothermic than the corresponding stage of 
the steam reaction. According to the concept of chemioal inertia (Evans & Polanyi 
1938 ), a difference of this magnitude should lead to a greatly increased activation 
energy, so it is most unlikely that the process represented by equation (25) would 
occur at any appreciable rate. In addition to these sites, however, the charcoal 
surface probably contains a smaller number of atoms which are each able to form 
; two extra bonds; such atoms will be a feature of incomplete benzene rings, and might 
1 also exist at special positions on lattice edges whioh have not yet been attacked. 
Although they are dearly capable of decomposing the steam molecule, these divalent 
carbon atoms are no better suited to that purpose than are the other aotive sites, 
because neither the hydrogen atom nor the hydroxyl radical can utilize more than. 


Vol. 19s A. 


36 



396 


F. J« Long and K. W* Sykes 

one free valency. No alteration need therefore be made to the treatment of the 
steam reaction, since the single bond sites may be supposed to make the predominant 
contribution by virtue of their greater abundance. The distinctive feature of the 
divalent carbon atoms is the possibility of a double bond, the formation of which 
will undoubtedly facilitate the disruption of the carbon dioxide molecule: 

^C + C 0 *^>C =0 + C 0 . (26) 

As this process is estimated to be 83 kcal.J more exothermic than that of equation 
(25), it should require a very much smaller activation energy owing to the increased 
driving force available (Evans & Polanyi 1938 ); it is thus thought to be the one 
responsible for the first stage of the carbon dioxide reaction. This view not only 
explains the comparative scarcity of the carbon dioxide sites, but will also account 
for their capacity to take up both carbon monoxide and hydrogen. Carbon monoxide 
can be adsorbed by the formation of a double bond, 

yc + CO -* ^>C=C=0, (27) 

the calculated heat of adsorption of 45kcal. being in good agreement with the 
experimental value of 46kcal. The alternative possibility of two single bonds 
enables hydrogen to be strongly adsorbed with the evolution of 89kcal.: 

>C + H^>C<*\ (28) 

H 

The present scheme is also consistent with the suggestion, made to account for 
the observed rates of reaction, that the final stage of the mechanism may be non- 
adiabatic. An electronic transition seems to be quite probable, because either type 
of active carbon atom is tetravalent while carrying an adsorbed oxygen atom, but 
may be essentially divalent in the carbon monoxide molecule formed (Long & 
Walsh 1947 ). As the two kinds of site are thought to be attached to the rest of the 
lattice by a different number of bonds, it is understandable that the transmission 
coefficient should not be the same for the steam and carbon dioxide reactions. 

It can thus been seen that the above ideas will account for the main features of 
the reaction mechanism. In view of present knowledge about the mobility of 
electrons in conjugated systems, the division of the less firmly bound carbon atoms 
into those which can form either one or two extra bonds cannot be regarded as more 
than a convenient approximation for a first approach. The surface probably contains 
a wide variety of different sites capable of forming bonds of an order almost con¬ 
tinuously variable from zero to a value possibly exceeding two. Steam should react 
with all those which can readily provide a bond of order approaching unity, and 

% Allowance must be made for the energy required to raise the active carbon atom from 
the divalent to the tetravalent state. This quantity is not known, but is probably similar to 
the corresponding one for the CH t molecule which is thought to be about 15 koal, (Long A 
Norrish 1946). The heat content changes involving the divalent carbon atom are thus cal¬ 
culated from the ordinary bond strengths, and a correction of -f 15 kcal. made to the results 
so obtained. 



397 


The mechanism of the steam-carbon reaction 

carbon dioxide only with the ones capable of forming with similar facility a link of 
considerably greater order. Further experimental work with different types of 
carbon is desirable in order to define more precisely the structural requirements for 
a given kind of site; on the theoretical side, the quantitative treatments of free 
valency now being developed by Coulson ( 1946 ) and by Daudel & Pullman ( 1946 ) 
offer a promising line of inquiry. 

Comparison with results obtained at low pressures 

The interaction of charcoal with steam at pressures of less than 1 mm. has recently 
been investigated by Strickland-Constable ( 1947 ) who studied the adsorption of 
the gases by the static method. Results for the oourse of reaction at 700° C in good 
agreement with those described in the present paper were obtained, but a somewhat 
different interpretation was proposed. Steam was thought to be adsorbed without 
dissociation, and subsequently to react with the carbon to give molecules of hydrogen 
and oarbon monoxide also held to the surface only by physical forces; the slowest 
of these changes was considered to be the final desorption of carbon monoxide. This 
meohanism was based on the observation that evaporation of the adsorbed ste^m 
is an important factor at 600° C and can be detected even at 700° C. It was argued 
that since oxygen and nitrous oxide react extremely rapidly with charcoal at these 
temperatures, any adsorbed oxygen atoms would immediately be converted into 
gaseous oxides of carbon and therefore could not be recovered as steam. 

The greater part of the evidence relating most directly to the steam reaction itself 
can hardly be reconciled, however, with the point of view adopted by Strickland- 
Constable. It has been shown conclusively that although the interaction of hydrogen 
with charcoal is indeed of a physical nature from -195 to 0 ° C, it exhibits ail the 
characteristics of a chemical change between 350 and 950° C (Barrer & Rideal 1935 ; 
Barrer 1935 ). The activation energy of 75kcal. now' obtained for the adsorption of 
steam strongly suggests that the molecule is being dissociated. Direct evidenoe for 
the corresponding decomposition of the carbon dioxide molecule (activation energy 
68 kcal.) is provided by the simultaneous liberation of the carbon monoxide so 
produced. The slow evolution of carbon monoxide during the steam reaction, a 
stage requiring an activation energy of 55kcai., can then be visualized, not as a 
purely physical process in which no chemical bonds are either broken or formed, but 
rather as the removal of a oarbon atom from the lattice. The value of 46kcal. found 
for the heat of adsorption of carbon monoxide also implies the operation of valency 
forces. Furthermore, the assumption that a significant amount of physical adsorp¬ 
tion is even a possibility under these conditions appears to be inconsistent with 
fundamental principles. Since the adsorption of a gas is accompanied by a marked 
decrease of entropy, the process is, in general, sufficiently exothermic to take place 
to an appreciable extent at high temperatures only if chemical bonds are formed. 
Dispersion and polarization forces acting on small molecules give rise to interaction 
energies of only a few kcal.; they have not, as far as the authors are aware, ever been 
found to cause strong adsorption in the temperature range 700 to 800° C. 



398 F. J* Long and K. W, Sykes 

It will now be shown that although steam and carbon dioxide are thought to 
give rise to adsorbed oxygen atoms, the very much greater rates at which oxygen 
and nitrous oxide react with carbon can, nevertheless, be explained. Whereas the 
reactions of carbon dioxide and of steam are endothermic, those of oxygen and of 
nitrous oxide are exothermic. The heat content changes for the conversion of 1 g. 
atom of carbon into carbon monoxide (Bichowsky & Rossini 1936 ) are respectively 
4 - 41, 4-31, -27 and ~46kcal.; if carbon dioxide is the primary product of the 
last two reactions, the corresponding values are —94 and — 134koaI. In the exo¬ 
thermic process, as in the endothermic one, oxygen must first be transferred to the 
carbon surface and must then evaporate as a gaseous oxide of carbon. If carbon 
monoxide is produced, the second stage must be the same as that of the endothermic 
mechanism, and even if carbon dioxide is formed, considerable energy will still be 
required to remove the carbon atom from the lattice. If follows, therefore, that the 
heat of the exothermic reaction is liberated in the first stage, and consequently that 
each oxygen atom so adsorbed must possess at first an exceptionally large vibrational 
energy. After a small number of oscillations, this energy will have been distributed 
among the many degrees of freedom of the solid; the oxygen atom will then be in 
the same state as one obtained from carbon dioxide or steam, and will react further 
only at the comparatively slow speed characteristic of the endothermic oxidation. 
There is clearly a possibility, however, that before the excess energy is dissipated, 
it might be used to overcome the large activation energy required to disrupt the 
carbon lattice. Complete reaction would then take place in an effectively instan¬ 
taneous process, the speed of which would be limited only by that of the initial 
adsorption. The latter should be very much faster in the exothermic than in the 
endothermic reactions, sinoe the activation energy should be considerably reduced 
by the greatly increased driving force available. It is suggested, therefore, that 
a process of this type competes with the formation of the relatively stable oxide 
film, and is responsible for the high speed of the exothermic oxidation. 

These considerations not only meet the difficulty raised by Strickland-Constable, 
but also predict the type of mechanism which is found experimentally. A single stage 
reaction caused by impact on the clean surface and the simultaneous formation of 
a retarding film have been observed in many investigations of the oxygen-carbon 
system (for a summary, see Strickland-Constable 1940 ); they are illustrated with 
particular clarity in the work on diamond at 300° C carried out by Barrer ( 19366 ). 
No explanation of why the kinetic behaviour should be of this form appears to have 
been given hitherto. Nitrous oxide reacts with charcoal at 400° C to form two 
distinct kinds of surface oxide; a stable compound similar to that produced by oxygen, 
and a labile compound which with a further molecule of nitrous oxide rapidly evolves 
carbon dioxide (Strickland-Constable 1938 a, 6 ). The stable oxide was identified 
with chemically adsorbed oxygen, but no clear picture of the labile oxide could be 
offered. The exceptional properties of the latter compound can, however, readily be 
accounted for if it is regarded as an adsorbed oxygen atom which has not yet lost 
the excess energy liberated during its formation. 



399 


The mechanism of the steam-carbon reaction 

The authors wish to thank Professor C. N. Hinshelwood, F.R.S., for his interest 
in this investigation and for many helpful discussions. This work has been oarried 
out in close relation with the programme of the Fuel Research Board of the Depart¬ 
ment of Scientific and Industrial Research. One of the authors (K.W.S.) holds an 
Imperial Chemical Industries Research Fellowship, and the other (F. J,L.) is indebted 
to the Company for the opportunity to participate in the work. 


References 

Barrer, R. M. 1935 Proc. Roy . 80 c, A, 149, 253. 

Barrer, R. M, 1936 a J. Chem. Soc. p. 1256. 

Barrer, R. M. 19366 J. Chem . Soc. p. 1261. 

Barrer, R. M, & Rideal, E. K. 1935 Proc. Roy. Soc . A, 149, 231. 

Biehowsky, F. R. & Rossini, F. D. 1936 The thermochemistry of chemical substances. New 
York: Reiahold. 

Bonhoeffer, K. F., Farkas, A. & Rummel, K. W. 1933 Z. phys. Chem . B, 21 , 225. 

Coulson, C. A. 1946 Trans. Faraday Soc . 42, 265. 

Daudel, R, & Pullman, A. 1946 C.R. Acad. Sci ., Paris , 222 , 663. 

Evans, M. G. & Polanyi, M. 1938 Trans. Faraday Soc. 34, 11. 

Gadsby, J., Hinshelwood, C. N. & Sykes, K. W. 1946 Proc. Roy. Soc . A, 187, 129. 
Herzberg, G. 1939 Molecular spectra and molecular structure. 1. Diatomic molecules. New York; 
Prentice Hall. 

Hinshelwood, C. N. 1947 J. Chem. Soc . p. 694. 

Kistiakowsky, G. B, f Ruhoff, J. R., Smith, H. A. & Vaughan, W. E. 1936 J. Amtr. Chem. 
Soc. 58, 146. 

Lewis, B. & von Elbe, G. 1935 J. Amer. Chem. Soc. 57, 612. 

Long, L. H. & Norrish, R. G. W. 1946 Proc, Roy. Soc. A, 187, 337. 

Long, L. H. <k Walsh, A. D. 1947 Trans. Faraday Soc. 43, 342. 

Muller, S. <fc Cobb, J. W. 1940 J. Chem. Soc. p. 177. 

Riley, H. L. 1947 Quart . Rev. Chem. Soc. 1, 59. 

Strickland-Constable, R. F. 1938 a Trans. Faraday 8 ?c. 34, 1074, 

Strickland-Constable, R. F. 19386 Trans. Faraday Soc . 34, 1374. 

Strickland-Constable, R. F. 1940 Fuel , 19, 89. 

Strickland-Constable, R, F. 1947 Proc. Roy . Soc . A, 189, 1 . 

Walsh, A. D. 1946 Trans. Faraday Soc. 42, 779. 

Walsh, A. D. 1947 Trctns. Faraday Soc . 43, 60. 

Waters, W. A. 1946 The chemistry of free radicals f p. 72. Oxford University Press. 



The control of the temperature of endothermic 
reactions in flow systems 

By J. Gadsby and K. W. Sykes, Physical Chemistry Laboratory, Oxford 

(Communicated by C. N. Hinshelwood, F.R.8.—Received 27 November 1947 

—29 April 1948) 


The type of flow system which is convenient for investigating the kinetics of many chemical 
reactions presents special difficulties of temperature control which become increasingly acute 
the greater the speed of the reaction. If the change is highly endothermic the problem of 
maintaining a distribution of temperature uniform in space as well as in time becomes of 
great importance, and unless special precautions are taken tho spatial distribution may vary 
considerably with changes in tho rate of reaction. An apparatus is described, utilizing two 
relay circuits operated by suitably spaced thermocouples, by which the desired automatic 
regulation of temperature can be achieved. 


Introduction 

An account is given below of the special method of temperature control used in the 
kinetic investigations described in the two preceding papers. The reactions of 
carbon with carbon dioxide and with steam are both highly endothermic (41 and 
31kcal.), and when they were studied in a flow system, complications due to the 
absorption of heat became aoute as the measurements were extended to higher 
temperatures. The type of electric furnace normally used for kinetic work was 
found to develop in the reaction zone an extremely uneven distribution of tem¬ 
perature, which depended markedly on the rate of reaction. However, to make 
measurements sufficiently accurate for the estimation of reliable activation energies 
for the individual stages of the reactions, it was essential to maintain a temperature 
uniform throughout the reaction zone to within ± 1 0 C under all conditions. This was 
an experimental problem on which there were few published data; it is thought that 
the method by which it has been solved for the present purpose may be of more 
general application. 

« * 

The experimental system 

Some further details of the experimental arrangement are relevant to the following 
discussion. The silica reaction tube, 3-1 cm. in diameter and 55cm. in length, fitted 
loosely inside a silica liner of a vertical electric furnace 38 cm. in length. The gas 
stream was led in at the bottom of the tube and out at the top. The reaction tube was 
fitted with a central sheath of 4 mm. diameter silica tubing, and the temperature 
distribution inside this was explored with a thermocouple made from 26s.w.g. 
chromel and alumel wires so that it had a small heat oapaoity and a quick response. 
An interval of 1 min. between moving the thermocouple and recording the tem¬ 
perature ensured equilibrium. Errors introduced by slow drifts in the temperature 

[ 400 ] 



The control of the temperature of endothermic reactions inflow systems 401 

as ft whole were minimized by tftking two sets of readings obtained by moving the 
thermocouple first in one direction and then in the other. The furnace was wound 
in the way described by Norrish & Wallace ( 1934 ) on an externally grooved silica 
former of pitch ten turns to the inch, using 22 s.w.g, *Red ITox 135 ’ resistance wire. 
Tappings were made at four points in the winding, so that five sections of lengths 
4*3, 4*3, 14*0, 4*3 and 4*3 cm. were formed, With the reaction tube empty the power 
dissipation in the various sections was adjusted until a fairly even temperature 
distribution in the central zone was obtained; this is shown as curve (a) in figure 1 . 



Figure 1. Temperature distribution in furnace. Tube packed with silica chips; 

(«) no gas passing; (b), (e) and (d) air at 1200 , 600 and 200 c.c./min. 

The temperature of the furnace as a whole was controlled by a second chromel- 
alumel thermocouple, the e.m.f. of which was balanced by a potentiometer with a 
mirror galvanometer as a null instrument. When the temperature fell below the 
desired value, the light reflected from the galvanometer illuminated a photocell 
operating a relay circuit which increased the furnace current. With a Cambridge 
10 Q d’Arsonval galvanometer, regulation in the empty reaction tube to ±0*01°C 
at 800° C could be obtained. The attainment of this degree of regulation is attributed 
to the absence of any opaque insulating material, such as alundum or asbestos, to 
prevent direct radiation from the furnace winding to the controlling thermocouple 
in the centre of the reaction tube. With the reaction tube empty the time elapsing 
between a small change in the heating current of the furnace and the response of 
the control circuit was of the order of 1 or 2 sec.; when the reaction tube contained 
carbon, or any opaque material, the time lag was about 1 min. and the best regula¬ 
tion which could be obtained was of the order of ± 0 * 2 ° C. 




402 


J. Gadsby and K. W. Sykes 

Investigation or the temperature distribution 

To study the effect of gas flow, the reaction tube was filled with broken silica, and 
a current of 1200 c.c./min. of air passed through it. After readjustment of the 
furnace, the fairly even distribution shown as curve (6) in figure 1 was obtained. 



Figure 2. Effect of heat of reaction on temperature distribution and its correction. Total 
rate of flow 1200 o.c./min. (e), (/) and ( g) no CO |f 129 mm. CO, and 380 mm. CO, at a given 
furnace setting. ( h ), (t) and (J) 380, 129 and 760 mm. CO, at another furnace setting, (k), ({), 
(*») and (»} automatic oontrol at 307, 380 and 253 mm. CO t at 755° C, and 258 mm. CO t at 
776° C. 

The distributions (c) and (d) were then obtained by reduoing the air current to 
600 and 200 c.c./min. without altering the setting of the furnace. In these experi¬ 
ments the temperature as a whole was not controlled, the furnace merely being 
allowed to attain thermal equilibrium at each of the three rates of flow which were 
used. It may be Been that the change in the rate of flow did not affect the temperature 
in the upper part of the furnace, and that in the oentral zone, although the tern- 




The control of the temperature of endothermic reactions in flow systems 403 

perature as a whole rose as the rate of flow was reduced, the form of the distribution 
was not much changed* These results suggested that the gas stream probably 
attained the furnace temperature before it reached the central zone, and that small 
changes of flow and specific heat were not likely, therefore, to be a serious problem 
in the establishment of a uniform temperature distribution. 

The effect of the heat of reaction was investigated as follows. 29*7 g. of charcoal 
granules were placed between small pads of asbestos fibre in the middle 10 cm. of the 
reaction tube, and the furnace adjusted to give an even distribution at about 810° C 
with a rate of flow of 1200 c.c./min. of nitrogen (ourve (e), figure 2). Then, keeping 
the total rate of flow constant, the nitrogen was replaced by various mixtures of 
carbon dioxide and nitrogen and the temperature distribution redetermined. The 
distributions (/) and (g)> obtained with mixtures containing carbon dioxide at 
partial pressures of 129 and 380 mm. respectively, show that the heat of reaction, 
although small compared with the power supplied to the furnace (4 and 6 W respec¬ 
tively compared with 180 W), nevertheless had a very considerable effect on the 
temperature distribution. Knowing the temperature coefficient of the reaction rate 
(approx. 2*5 % per 0 C) and assuming the lower end of the reaction zone to have 
been maintained at a constant temperature, it was calculated that the observed 
rates of reaction were only about 70 % of the values which would have been obtained 
if a uniform temperature had been maintained. An apparatus in which large errors 
may arise because the temperature of the reaction zone depends on the rate of 
reaction is obviously unsuitable for kinetic work; these experiments showed clearly 
that in a flow system of this kind the control of the temperature at a Bingle point is 
not adequate to ensure uniform conditions of temperature, even if the furnace is 
adjusted initially to give a suitable temperature distribution. 

Automatic control of the temperature distribution 

The automatic control of the temperature distribution is made possible by the 
change in the form of the distribution as the net rate of supply of heat to the reaction 
zone is varied. This may be seen from the curves (A), (t) and (j) in figure 2. The first 
of these was obtained by adjusting the furnace to give a fairly even distribution with 
a gas stream containing carbon dioxide at a partial pressure of 380 mm.; the dis¬ 
tributions (i) and {j) were obtained with carbon dioxide pressures of 129 and 760 mm., 
giving lower and higher rates of reaction respectively. A comparison of these curves 
shows that in the lower part of the reaction zone the distribution became more 
concave in form as the rate of reaction was increased. If the furnace had been ideally 
adjusted so that the distribution for 380 mm. of carbon dioxide had been uniform, 
then the curve at the lower rate would have been convex in shape and that at the 
higher rate concave. Now it was known that a convex distribution could be straight¬ 
ened by decreasing the current through the centre section of the furnace winding 
and a ooncave one by increasing the current. A differential thermocouple was 
therefore used tor detect the form of the distribution, and to oontrol it by regulating 



404 J. G&dsby and K. W. Sykes 

with an additional relay circuit the power supplied to the centre section of the 
furnace winding. 

The arrangement finally adopted is illustrated in figure 3. The composite thermo¬ 
couple T consisted of a single 26s.w.g. ehromel wire with alumel wires of the same 
gauge welded to it. Except at the junctions the wires were carried in narrow, mul¬ 
tiple-bore silica tubing, and the whole thermocouple was placed in the oentral 
sheath of the reaction tube with its lowest junction at the bottom of the reaction 
zone. This junction, with the potentiometer P, mirror galvanometer 0 % and relay 2 

variable 



Figure 3. Circuit for automatic control of temperature distribution. 

shorting the variable resistance ft 7 in series with the furnace windings P, controlled 
the temperature at the bottom of the reaction zone. The e.m.f. between the two lower 
alumel leads, indicating the nature of the distribution, was applied to the mirror 
galvanometer O t associated with relay 1 ; the latter placed an additional resistance 
R a in parallel with the centre section of the furnaoe winding when the distribution 
became convex. P 4 was chosen so that the operation of the regulator caused a change 
of about 20 % in the power supplied to the centre section of the furnace winding. 
Any tendency for the distribution to fluctuate unsymmetrically could easily be 
seen from the movement of O x and corrected by a small adjustment of J?*, the main 
resistance across the middle section. A form of coupling between the two control 
circuits was caused by the drop in the effective resistance of the furnace when P 4 
was in circuit. This was eliminated by introducing a compensating resistance JS* in 
series with the furnace; i ? 4 and R* were brought into circuit together by relay 1 


















The control of the temperature of endothermic reactions in flow systems 405 

and iZ 6 was adjusted so that the total resistance remained constant. To minimize 
interaction in the thermocouple itself, the leads, and particularly the alumel one 
which is common to both circuits, should have a low resistance. The 26 s.w.g. wires 
constituting the thermocouple were therefore joined as near the furnace as possible 
to heavier leads of the same materials, and these in turn were soldered to 18 s.w.g. 
copper wire, the copper-chromel and oopper-alumel junctions being kept at 0° C. 
The third junction was added to the thermocouple to indicate on the galvanometer 

the difference in temperature between the top and bottom of the reaction zone; 
this difference could be kept small by an occasional adjustment of R x and The 
junctions operating the control circuits were placed at the lower end of the reaction 
zone, since the rates of reaction were greatest in that region on account of the 
retarding aotion of the products formed as the gas stream passed upwards through 
the charcoal column. 

The use of the two control circuits in this way was found to be satisfactory; the 
temperature distribution was generally uniform to within 2° C and after careful 
adjustment the variation at any point was of the order of ± 0-5° C. Some typical 
records of the temperature distribution during the measurements of the rate of the 
carbon dioxide reaction at a series of pressures and temperatures are shown in 
figure 2, curves (fc), (Z), (m) and (n); they may be compared with the distributions 
above obtained without the differential control. For the steam reaction, the be¬ 
haviour of the galvanometers indicated that the thermal conditions were equally 
satisfactory. 



live neutral 

A.C. 


Fiotteb 4. Photocell-operated relay circuit. V v F, rectifying valves; F g caesium gas-filled 
photocell; V 4 tfayratron; F* thermal delay switch. X relay. R a resistance comparable with 
that of the photocell when illuminated. 



406 


J. Gadsby and K. W. Sykes 


The photocell-operated belay circuit 

The circuit diagram of the photocell operated relay circuit is shown in figure 4, 
A thyratron valve V A fed with an alternating anode voltage is used as a relay, con¬ 
trolled by a photocell V s in the grid circuit. The photocell is incorporated in a poten¬ 
tiometer network which is supplied with direct current from an auxiliary rectifying 
system ( V v V i). When the photocell is in the dark, its resistance is large compared 
with that of R and the grid of the thyratron is held sufficiently negative for the valve 
to pass no current. On illuminating the photocell, its resistance falls and the con¬ 
sequent decrease of the negative grid bias allows the thyratron to operate the relay X , 
This circuit is in some ways more cumbersome than the phase-shift circuits which 
are commonly used, but it has proved simple in construction and adjustment, and 
reliable in operation. 

The authors are grateful to Professor C. N. Hinshelwood, F.R.S., for his interest 
and encouragement. This work forms part of the programme of the Fuel Research 
Board of the Department of Scientific and Industrial Research, and this paper is 
published by permission of the Department. One of the authors (J.G.) was a member 
of the Department’s staff. The illustrations are Crown copyright and are published 
by permission of the Controller of His Majesty’s Stationery Office. 


Reference 

Norrish, R. G. W. & Wallace, J. 1934 Proc. Roy. Sac. A, 145, 307. 



The use of interlacing nets for the application of relaxation 
methods to problems involving two dependent variables 


By D. C. Gilles 

With a foreword by W. G. Bickley 
('Communicated by R. V. Southwell, F.R.S.—Received 28 November 1947) 


In the solution, by relaxation methods, of partial differential equations involving more than 
one dependent variable, it haa been customary to find the values of all such variables at the 
nodes of one net. By considering the relative accuracy of the finite difference approximations 
to the individual terms of the equations, and for other reasons, it is suggested that values of 
some of the variables be determined at the centres of the meshes, which amounts to using two 
interlacing nets. This idea is developed, and the method is applied to a variety of problems. 


Foreword 


Problems involving more than one dependent variable are common in applied 
mathematics, and while in general an attempt is made to eliminate all but one of 
these so as to be able to restrict attention to a single ‘unknown’, this is not always 
possible, nor is it always convenient. An instance occurs in two-dimensional elastic 
problems, where the boundary conditions in terms of the stress function are com¬ 
plicated. In applying relaxation methods, it is simpler to work throughout in terms 
of the components u , v of the displacement, and the so-called ( u~v technique’ was 
invented (Fox & Southwell 1945 ) for this purpose. 

Denoting Cartesian co-ordinates by x, y> and Poisson’s ratio by 0 *, one of the 
governing equations is 

d*u „ „ ,dhi d*v 




' 2<T) dy i+ Fxdy °‘ 


( 1 ) 


The relaxation technique starts from an approximation to this in terms .of finite 
differences, involving values of the wanted functions u, v, at nodes of a net, usually 
a square net, say of mesh length a. A oritioal examination of the method indicated 
that while the effective mesh length used for dhi/dx * and d t ujdy i was a, that used 
for d*vjdxdy was really 2 a. 

To see this, note first that, using suffixes as indicated in figure 1 to denote the 
location of the point at which a value of u or v is taken, 


'* is an approximation to j— 


centred at the mid-point of the segment 01 , and so 

1 /«,-«„ . . .. . 

a\ <»—a/ 18 an a PP roxunat,lon 40 gji 

[ 407 ] 



408 


D. C. Gffles 


centred at 0 V and using consistently the mesh length a . But* with values of v given 
at the same mesh points, the best we can do for a finite difference approximation to 


dht/dxdy is 


1 ( H-H 

2 a \ 2a 2a / * 


where the first term represents the finite difference approximation to dv/dy centred 
at the point 1 , and where the fact that the effective mesh length is 2 a is abundantly 
evident. 

If we wish the mesh length associated with v to be a, so that the accuracy of the 
v-term in equation ( 1 ) may be comparable with that of the other terms, then we must 
use values of v at the centres of the squares of the net of figure 1 , i.e. we must use a 



Figure 1 


Figure 2. The interlacing nets. 


separate net for v t which interlaces that for u , as indicated in figure 2 . We note 
immediately that the a-points are at the centres of the squares of the v-net; the 
relation between the nets is reciprocal, and consequently the finite difference 
approximation to the other governing equation, 


(1 — 2 <r)^ + 2 (l—a) ¥ , + ^ = 0 , 


can be expressed, consistently as to mesh length a, in terms of values at the nodes 
of the same two interlacing nets. 

These ideas were reinforced—ooincidently—by papers by Kron (i 944 )and Carter 
( 1944 ) in which an eleotrio circuit analogy, and a network analyzer, were applied 
to problems in elasticity. In these, u and v are given by potentials at nodes of 
an electrical network, and it is clearly neoessary that u and v be measured at 
different points, i.e. that two networks (appropriately coupled or interconnected, 
of course) be used. 

Onoe the idea of interlacing nets is accepted, it becomes clear that there are many 
other problems involving two unknowns to which they can appropriately be applied 








Interlacing nets for relaxation methods 409 

The most obvious disadvantage, that the two unknowns are not determined at the 
same points, is comparatively trivial compared with the two main advantages, the 
first (minor) that the use of interlacing nets enables the ‘ coupling * effects to be more 
accurately taken into account, and the second (major) that one is dealing with 
differential equations of lower order than if one variable is first eliminated between 
the governing equations. For instance, the equations (1) and (2) are each of the 
second order, whereas the corresponding equation for the stress function is of the 
fourth order; the finite difference approximation to this equation is more com¬ 
plicated, and also extends over a region of the x-y plane four times as large. In what 
is probably the simplest instance of the use of interlacing nets, the Laplace equation 
(second order) is replaced by the Cauchy-Riemann equations, and both the conjugate 
functions are concurrently determined. 

The ensuing sections illustrate the use of interlacing nets in the solution of a 
representative variety of problems by relaxation methods. The actual problems are: 

(i) Plane strain with boundary displacements given (§§ 1 to 4). 

(ii) Stresses in a solid of revolution (§§5 to 8 ). 

(iii) The conjugate plane-harmonic functions (§§ 9 to 12 ). 

(iv) The quasi-plane-harmonic equation (§§ 13 and 14). 

The order in which they are given is that in which they were attaoked, rather than 
that of their mathematical content or complexity. W.G.B. 


Plank strain 


1. If u and v are the displacements in the x and y directions respectively, then 
for the two-dimensional elastic problem, the equations of equilibrium may be 
written as 2 % M 0 


2<T) d y^dxdy 

d*v d*v 9% 

( 1 _ 2 (r) _ + 2 ( 1 _. )¥a+ „_ == o, 


(3) 


where <t is Poisson’s ratio (Love 1927 , § 94). 

Using interlacing nets of mesh length a as indicated in the introduction, a typical 
w-point may be considered, denoted by 0, its four surrounding points 1, 2, 3, 4 and 
the v-points v, vi, vii, viii. Thus finite difference approximations to the terms in 
the first of the equations (3) are given as expressions involving the values of u and 
v at these 9 points by 



~ a T^ 2 ) - - 4 ( u i + m 8- 2w o)- 

?^j = i( Wa + M4 -2 Uo ), 

_ « 2 . (v v -i> vt + v yll - IV,,,) 


with an error in each case of order a*. 



410 D. C. Gilles 

Hence the equation may be replaced by its finite difference equivalent, viz. 
2(1-0-) («! + u a - 2u 0 ) + (1 - 2<r) (Mg 4- « 4 - 2 m 0 ) + My - fl-ri + v *ll —W vlU - 0, 
or in the ‘residual’ form, 

(-FJo « 2(1 -o-)(Mg+a,) + (1 - 2o-) (M a + m 4 ) - 2(3 - 4o-)m 0 + m v - + m*, -«*„ = 0. (4) 

Because of the reciprocal relation between the nets, any typical v-point may be 
denoted by 0 and the surrounding points lettered as in the diagram (figure 3) whence 
the second of the equations (3) above may be written as 

(JP) 0 s(1 -2o-)(»! + m 8 ) + 2(1 -o-)(u 2 + v x ) -2(8-4c) v 0 + u y -u vi + m,,,-= 0 . (6) 

The field of work is thus covered with these two interlacing nets, the continuous 
lines for u and the broken lines for v. At all nodes on the M-net there will be values of u 
and the M-residuals, calculated from the equation (4) above. Similarly, at all nodes 
of the v-net, there will be values of u and M-residuals calculated from equation (5). 




' w 

1 

f u ; 

1 

* 

4 

O t _ 

) 

t 

1 _ 

1 

1 

4 

t 

. y'Jy.. 

( 

1 

. i - 

> 

t 

1 

vii* 

4 

**iil 

1 

1 

1 

1 

1 

4 

1 

1 

1 

1 

1 

__1 » - 

1 

1 


Figure 3 Figure 4 


2 (i-<r) 


i- 2 tr 


I -2C- 




I-2CT 


-2(j-«cr) ■ 


2(i-tr; 


2 (i-«r; '-2 <T 

u-pattern v-pattern 

Figure 5. The relaxation patterns. 


The relaxation process itself is effected with the aid of two ‘relaxation patterns’, 
shown in the diagram (figure 5). These are patterns giving the effect on the residuals 
in the region of the point of adding a unit increment to the value of u or » at that 
point. Similar patterns have been discussed in other papers on the subject, and 
interlacing nets introduce here no other new feature as regards their use. 








411 


Interfacing nets for relaxation methods 

2, It is obvious, however, that the boundary conditions must require some 
special consideration. If the boundaries are rectangular, the nodes of both nets can 
be made to lie on the boundaries by taking the mesh lines at 45° to the boundary 
lines (figure 6). 

Then, the values of both u and v on the boundaries being known, the residuals 
may be calculated for both u and v all over the region. 

In all other types of boundary, and for the rectangular case where the boundaries 
are made to coincide with mesh lines, the problem cannot be*solved so easily. 



Figure 6 Figure 7 


Consider a typical w-point lying near to such a boundary, with the mesh lines 
leading to the point 1 cut by the boundary, so that we have an ‘irregular star* 
(figure 7). The incomplete line 01 may be completed, and led to the point 1 outside 
the boundary, which is called a ‘ fictitious * point . 

Let OB — £a, and expand u in the line 301 in powers of x so that 

u = w 0 + ax 4- fix? + yz 3 4*.... 

Hence, substituting u B * u 0 + a£a + fi£ 2 a % + y£ 3 a 3 +..., 

u x * + + ya* +..., 

u s — u 0 —oca+fia 2 —ya 3 4- .... 

Thus Mj 4 - u 3 = 2 w 0 4 - 2fia 2 4 * 0( a 4 ), 

u B + gu s « (1 + £)*« + $;( I +£)a 2 4-0(a 3 ), 
so, with an error of order a 3 


«» + £«»-U + f;)«o 


£ 0+0 


u '~mT) UB+ iH u 


l-e.. 2(1-0. 


This enables the value of u at the fictitious point 1 to be calculated when the values 
at the points inside are known. Thus the residuals involving the value of u at this 
fictitious point can be calculated. For a period of the relaxation the value at this 
point can be treated as constant, and then when the residuals are checked, it may be 
adjusted, and then treated as a constant again. 


Vol. 193 - A. 


«7 




412 


D. C. GUles 


Alternatively this value of u may be introduced into the formulae for the residuals 
at those points involving the value at the point 1, altering the relaxation patterns 
at these points. This is usually the best procedure when the boundaries are straight 
lines and the nodes of one net lie on the boundary. 

Similar results hold when any other arm is also a short arm, and also for the value 
of v at a fictitious point. 

3 , As an example, consider the extensions of a rectangular steel plate (<r =» 0 * 3 ) 
the sides of which are in the ratio 3 :1 and boundary displacements are (figure 8) 

AB> BC t CD u~ 0, i>*0. 

AD u - 0, v = 1000 { 4 (x/L)* - 9 }. 

There is symmetry about the y-axis, where clearly u * 0, so that only the right- 
hand half of the plate will be considered. 



B -3 L --C 

Figure 8 


Two ways of placing the nets for convenience are open to us, with the mesh 
lines parallel to, or at 45 ° with, the sides of the rectangle. We choose the former. 

Taking the v-net to lie on the boundary, the w-net therefore straddles it. For these 
w-points £ * £ and u * 0 all round, so that the fictitious value is given by 

u i = fttj 3 -f* 3= — 2u q ( u # = 0 ), 

and similarly when u 2 , u x are fictitious. 

The general equations are, since <r = 0 * 3 , 

(JPJo - 1 ' 4 (w x + %) 4 * 0 - 4 (w a -f w 4 ) — 3 * 6 w 0 *f v v — v vi + v vii — v vlu = 0 , 

(^)o s0,4 ( v i + v s)+ 1 ’ 4 ( v 2 + i; 4 )'- 3, H + ^v“ /M vi + w vti“* w vin = 0, 

and for the incomplete stars, the value of u was substituted in the residual formula 
from the relation above. 

In practice the coefficients were multiplied by 15 to make them all integers. 

A mesh length, a « \L, was first considered, and then by trisecting this, a mesh 
length a » which had the advantage that all the previous nodes were still nodes 
of the new nets. From these results, estimates were made for values at the nodes of 
nets of mesh length a * \L and the solution for this net size obtained to compare 
with that for a net of mesh length a » \L but without interlacing nets (Allen & 
Southwell 1944). These last three results are given (figures 9 to 11). 



Interlacing nets for relaxation methods 


413 



i 


Figure 10. Displacements in a plate, mesh length a = JL. 


27'Z 




















416 


D. C. Gffles 


If the four surrounding points, at which' values of « and v are known are labelled 
i, ii, iii, iv it may be seen that, centred at this point A, 

(SL-5(| 

with effective mesh length a, so that the accuracy is not impaired. Hence at an 

^’ P ° mt S = {(A + 2/t)(tt 1 -M lu )+A(v„-v lv )}/o,| ^ 

yy = {A(w, - « U1 ) + (A + 2 p) (% - v lv )}/a.j 
Likewise at a U-point, expressions for (dv/dz)> (dujdy) may be obtained, and 

xy «= fi(v j - v m + % - u ty )/a . (9) 

Thus the stresses may all be evaluated. They have been computed for the plate 
problem, using the results for the mesh length a - \L , and are given in figure 13. 


Stresses in a solid of revolution 

5. If V and w are the displacements in the r- and z-directions in a stressed solid 
of revolution, then the dilatation A and the rotation m are given by 

dU dw 


dU U dw 
dr + r^dz' 


2m e « 


dz dr' 


0 . 


Putting u * rll t these may be reduced to 


. 13 u dw 

A = 

r dr dz 


1 du dw 


and the equations of equilibrium in terms of u and w can then be shown to be 




0 , 


(1 




dw\ A ._ .d 2 w 1 d*u 
r dr) + 2( 1 “ <r) . 0 ? + ~rdriiz “ °, 


( 10 ) 


where <r is Poisson’s ratio (Love 1927 , § 99). 

Using interlacing nets as before, the continuous lines for u and the broken lines 
for u>, a typical w-point can be denoted by 0 , its four surrounding w-points by 1 , 2 ,3,4 
and the ^-points by v, vi, vii, viii. If the mid-points of the arms 01, 02 , 03, 04 are 
lettered i, ii, iii, iv as shown in figure 14, then using a mesh length a 

®(|~) = Ug - « 0 + O(o*), o(^) - u 0 - u t +O(o»), 


so that 


f ( 

r ll 


, - « 0 ) - — (u 0 - u t ) + 0 (o»). 
r iv 



417 


Interlacing nets for relaxation methods 

Thus with an error of order a 3 , 



and, as before, to the same order of error, 



4 


Figure 14 


Thus the first of equations (10) may be replaced by the finite difference equivalent, 
(FJ o ■ 2( 1 - cr) fa (u a - u 0 ) + ^ (u 4 - «„)] 

l r ll r lv ) 

+ (l-2<r)(M 1 + w s -2M 0 ) + r 0 (w> v -«’vi+ M ’vH-« , viii) * 0- (H) 


In a like manner, a typical w-point may be denoted by 0, and the surrounding 
points may be lettered as in figure 14, whence finite difference approximations for 
the terms in the second of equations (10) may be obtained; in particular, that for 


the first term is 



Hence the finite difference equivalent of this equation is 

(Fw )o ® (1 — 2or) (to, - w 0 ) + (w 4 - w 0 )i 

l r o T 0 } 

+ 2(1 -«r)(w 1 + M)j-2w 0 ) + - (Mv-^vi + ^ii-Wviu) = 0. (12) 

r o 

From these equations, the residuals may be calculated and recorded as in the 
problem of plane strain (§ 1 and figure 4). The relaxation patterns can be obtained 
from the equations (11) and (12) but are more complicated, and since they involve 
values of r, they vary from line to line. 

In practice, it was found convenient to bring the coefficients of u v u 8 and w v w a 
to unity by division by the appropriate faotor. 



418 


D. C. GUles 


6 . In this problem also, owing to the nature of the nets, the boundary conditions 
require special consideration. Even with rectangular boundaries, the nodes of both 
nets cannot in general here be made to lie on the boundaries since the directions of 
the axes of r and z (and hence of the nets) are inherent in the form of the equations 
( 10 ), If any change is made in these directions, the form of the equations is altered, 
and they become even more complicated. 

If the displacements U and w are given on the boundaries, then any ‘irregular 
stars * may be tackled as in the problem of plane strain §2 using the formula ( 6 ) 
there obtained. 

Clearly, however, another type of condition could be imposed, that of known 
stresses over all or part of the boundaries. It may be shown (Love 1927 , § 99) that the 
stresses are expressed in terms of u and w by the equations 


"= 2(A + /t){~ + (l-2<r) 0 


crdu 


zz = 2(A -f //) 

^ /] du dw\ 

Zr -/ > Yz + dr)' 


rdr +(1 ~ Cr) dz 


dr\r 

dw 


tu W 


dw 

dz 


(13) 


If the boundary is a line parallel to the r- or z-axis, then two of these stresses will 
be known, and the corresponding equations can be used to give the values of u 
and w along this part of the boundary, and at any fictitious point that may occur. 

If the boundary is not such a line, then the stresses may be written in terms of 
vr and vz, where v is the direction of the outward normal. Then, referring to figure 15, 
these may be taken in terms of the stresses in equations (13) by (Love 1927 , § 47). 

1 vr * fr cos (r, y)*f rzsin(r, v) t vz = zzsin (r, v) H-fz cos (r, v). (14) 

These relations may be used, together with the equations (13) to convert the stress 
boundary conditions to conditions in u and w. 

7, As an example of mixed boundary conditions, the problem taken was that of 
the displacements in a solid steel cylinder (e r = 0*3) radius R and length 3R, subject 
to the boundary conditions zz = zr = 0 over the ends, and U ~ 1000 JB, w « 0 over 
the surface (figure 16). 

Considering a section through the axis it is evident that there is symmetry about 
the r-axis, where w = 0 and hence only one-half of the section need be considered. 
The boundary conditions along A B suggest that the w-net should be taken to lie 
on this line so that the values at the fictitious points for w are given by 

w % » f w B + - 2 w 0 » - 2 w 0t 


since £ * £ and w B * 0 . 



419 


Interlacing nets for relaxation methods 

.On the boundary BC, the stress conditions give, from equations (13) 

o/i , .ItrSu dw] _ /I du 8w\ 

JZ - 2(A+/<) j-jj +(l-,7)-gj - 0, zr,^- s + s -j.o. 

From the first of these equations 


0 /10W/\ . d 2 w 

^dr\rdr) + 1 ~ IT drdz ~ ° 


(15) 


(16) 


is obtained. Let the ^-net lie on the boundary BC. Then a typical portion of this 
boundary is shown in figure 17 with the ‘fictitious' u~ and te-points introduced. 






Fioubk 16 


Considering a typical «-point lying on the boundary, and lettering the points in the 
normal manner, the finite difference approximations to the second of equations (15) 
and to equation (16) can be shown to be, centred at this point 0 , 

Wj ^3 4* 4" w j vii M Vm) ** 

— (w* * «o) - ~ K ~ ««) + (1 - V) («V - w v\ + w Vll - ™vUl) « 0. 

r u n v 

Hence, in the equation for the residual at tins point, viz. 

(K)o * 2( 1 - or) (£ (U 2 - u 0 ) + r®. (« 4 - «„)) 

1^11 r tv I 

+ (l-2<r)(u 1 + u t -2u 0 ) + r 0 (w v -w vl + u> vlt -w vm ) = 0 , (11) 

the value of u for the fictitious point 1 can be substituted, and one obtains 

(J „) 0 as 2 ( 1 - tr) (^ («, - w 0 ) + ^ (« 4 - u 0 )l + 2 ( 1 - 2 cr) (m, - u 0 ) 
l r ll f iv ) 

-r 0 (l- 2 cr) (w v + u> vl - to*, - m' vHJ ) + r 0 (w v to vH - w vm ) = 0 . 





D. C. GiUes 


420 

This may be written as 

(FJo • 2(1 -or) (Mg -«„)(u 4 - «,)j + 2(1 -2cr)(«,-«„) 

r lv J 

+ 2<rr 0 (w; v — -f w vlI “ w; viu) ” 2r 0 (l — 2<r) («; vl — w yii ) = 0, 
and in virtue of the finite difference equivalent of equation (Id), this is 


(^w)o s 


2(1-2 <r) 

1 —O’ 



4 - 2(1 - 2 or) (w a - 1 * 0 ) - 2r 0 (l - 2<r) (w vi - u> vll ) » 0. 



This may be divided by 2( 1 — 2cr) to make the coefficient of unity, and, because 
o* w 0*3 in the example under consideration, it becomes finally 

(K)o ■ y («*- «o) + («4 - “o) j + «s - «0 - - «Vu) - 0. 

Hence the residual at this point can be found in terms of the values of u and w at 
points inside the boundary. 







Interlacing nets for relaxation methods 421 

At the point fi, where the boundary is cut by a u>-line, a finite difference approxi¬ 
mation to the first of equations (15) may be found, and is, using the lettering 
indicated in the figure, 

V 

r“ («V - w viti) + (1 - <r) (w x - w 0 ) = 0. 

r B 

Since r B *= r 0 this may be written as 

* ^ g 

W l = u 'o - (M v - «vllt) = «»0 —j~ K- 

enabling the ‘fictitious’ value to be found in terms of the values of the functions 
at points inside the boundary. This may be substituted in the equation (12) for the 
w-residual involving this fictitious point. 

Naturally the relaxation patterns for these points will be altered, and the new 
patterns can easily be found. It is interesting to note that the patterns for the 
neighbouring points are not affected in any way. 

The general residual formulae (11) and (12) become 

- { 2 + \ (~ + ““)} u o + Y («V - W YI + W’vii - «Vm) = 0. 

(FJ 0 = Wl + w a + ? (^ w t + ^ w 4 J 

-{ 2 + H % + r *)}” 0 + £ o (“*-«* + « w - Sr . u > - 0 , 

and those involving other fictitious points were obtained by substituting the values 
at these points in the general formulae. 

A mesh length a « £ i? was first taken, and this followed by taking a = $B. The 
relaxation involved no great difficulty, but since the centre increment in the relaxa¬ 
tion patterns was about — 9 for an increment to u and — 2*0 for an increment to w, 
the values of u were taken to one decimal place to complete the relaxation. This 
enabled the values of V — ujr to be computed to the nearest integer to compare in 
accuracy with the w~ values. This final solution is given in figure 18. 

8 . As in all elastic problems it is the stresses themselves that are frequently of 
greater importance than the displacements, and they may be obtained from the 
recorded values of the displacements in a manner similar to that in the plane strain 
problem (§4). 

As in this earlier case, there are in addition to the nodes of the continuous line and 
broken line nets two other kinds of node which have been called A - and JS-points, 
and it is at such points that the stresses can be evaluated. 




Figure 18 . Displacements in a cylinder, mesh length a = £/? 


1 




Figure 19. Stresses in the cylinder. 


[Note. These numbers must be multiplied by 10* and the ratio 
the stresses in lb./sq.in.] 


{/-extension on surface 
radius of cylinder, li 











Interlacing nets for relaxation methods 423 

The stresses are expressed in terms of u and w by the equations (Love 1927 , § 99 ) 

- . *-*,a+4£ + <*-<(“) + 4’). 


^ (<x du dw 

zz = 2(A+/t){-gj r + (l-tr) 


02 


zr 


( 1 an dv>\ 
rdz^dr)' 


(17) 


Considering a I?-point (§4 and figure 12 ), the values of w will be known at the 
points i and iii, and of u and the displacement U — ujr at points ii and iv. Hence the 
finite difference approximations for the terms in the first three of equations (17) 
may be deduced at such a point, and are, using a mesh length a, 

fdw\ 1 /du\ 1 . d [u\ dU 1 /T7 T . . 

so that these stresses fr, (R), zz may be determined. 

Similarly, at an A -point the stress zr can be determined. 

The stresses were computed for the solid cylinder problem of § 7 and are given in 
figure 19, 


The conjugate plane-harmonic functions 


9. The next, and perhaps both the simplest and the most striking, application of 
interlacing nets is to the determination of conjugate plane-harmonic functions. 
If 0 and are such functions then they are connected by the Cauchy-Riemann 
equations 

(18) 


dx 


ll and 


dy 


00 

dy 


00 

'dx’ 


and it is to these equations that one looks for the wanted functions. 

Using again two interlacing nets, the continuous lines for 0 and the broken for 0, 
consider a portion of the field (figure 20 ). 

In addition to the nodes of the 0 - and 0 -nets (0- and 0-points), there are two other 
kinds of points where a continuous line crosses a broken line. Typical points are again 
indicated by A and B in the diagram, the points being where a 0-line and a 0-line 
is in the ^-direction respectively. 

At the point A , labelling the four surrounding net points, which are 0- or ^-points, 
by 1 , 2 , 3 , 4, with an effective mesh length a, and centred at the point A, 


00 

dx 


-(0j —0 8 ), 


00 

dy 






424 D. C. Gilles 

so that the first of the Cauchy-Riemann equations ( 18 ) becomes 

0i-03 * 'I't-'l'v 

or as is better for relaxation purposes 

- p li ia 0i-03-0 r 3+0 r « = 0. (19) 

Considering the point B, in a similar manner it can be seen that the second of 
the equations ( 18 ) may be written as 

02-04 * ° r flf S*0*-04 + 0 r l-0 r S* °- ( 20 ) 

From these two equations the residuals may be calculated in the usual way for 
the existing values 0 and 0, but, and here is the new feature, the residuals are not 
at the nodes of the 0 - and 0 -nets but at the A - and 5 -points. Once this idea is 
accepted it will be found that there are manifest advantages in this method. 



1 

_i_: 

! 

i 

i 


■■■ 

WHM 

t 

i 

i 


Bj 

irayggB 

i 

1 

m 

Wtm 

1 

2 | 


t 

t 

! 4 

\ 

) 

J 3 

I 

B J, 

“ * 

1 

i 

i 


( 

1 

i 

( 

i 

i 

4 ! 

* 

t 

1 

i 

i 

) 

i 


Figure 20. The net points. 


10 . The relaxation patterns, showing the effect on the surrounding residuals for 
a unit increment to a value of 0 or 0 are given in figure 21. 

It can be seen that each pattern moves residual in a certain direction—that for 
0 diagonally downwards from right to left, and that for 0 diagonally upwards from 
right to left. This 4 sweep* of the residual may be used in the relaxation to move 
a residual as may be required in order most speedily to ‘liquidate 1 it. Continual 
employment of this point-relaxation will eventually reduce the residuals approxi¬ 
mately to zero. The relaxation is then complete. 

Naturally this process is both laborious and tedious, and clearly it is useful to 
consider the effect upon the residuals of adding a unit increment to two or more 
adjacent 0- or 0 -values at the same time. Such increments are referred to as blocks, 
and the prooess is called block-relaxation. The pattern for a blook may be built 
up by applying the separate patterns for 0 or 0 at all the points in question. 
It will then be seen that the pattern for the block preserves the ‘ sweeping * tendency 






425 


Interlacing nets for relaxation methods 

of the patterns for the individual points* and alters only the residuals at the points 
immediately outside the boundary of the block. An illustration is given in figure 22, 
the blook consisting of ^-points. Clearly the application of such a block will move 
residuals from one region to another and so eliminate the residuals more quickly 
than by point-relaxation. 


i- i 

I- -i |- \ 

i i- 

0-pattern ^-pattern 

Figure 21. The relaxation patterns. 


- _ r - - - 

* 

t 

1 

i 

> 

» 

i 

i 

* / 
' / 

—i 

1 

1 

1 

1 

t 

* 

i 

i 

h - - - r - - - 

! • / 

, * ✓ 

l- / 

s 
✓ 1 

' y- - r- 

/ 

i ! i! 

I . 

-hr 

i : 

_ i_i _ 

- 1- 

1 

1 

1 

~ “ r * - - 

f 1 

! /‘ 

! 

/ 

- r - - - 

/ 

/' | 

i 

^ l 

1 

• • *, 

i i 

1 1 

— p — -f — 

i 

- - r - h - 

i 

• 1 

| ■ 

-i-~~~ 

1 

1 

1 

f 

t 

I 

i i 

* j 

• 1 •! 
i , 

* i 

- ■ f - r “ 

i : 

i 

“p— ! — 

| U. - 

“ - y ' 7- 

1 1 

i 

i 

* 1- 

i 1 

t 

i : i 1 
_> , 

* - - - r * 


Figure 22. Block-relaxation. 


11. In conformal transformation, and indeed in all cases where the conjugate 
functions are required, the values of both functions cannot be specified aU round 
the boundary. Consider as an example the conformal transformation of a circular 
sector (figure 23), which will be solved later by the relaxation technique (§ 12). If 
the outer radius of the sector is R, the inner radius 1, and the angle of the sector 













426 


D. C. Gilles 


then conditions can be imposed on the values of <f> or \jr on three sides of the boundary, 
but not on the fourth. 

With the conditions j!*0on BC , \jf * 0 on AB and \jr = 1000 on DC, the analytic 
solution can be shown to be, in polar co-ordinates, 


4000, r 

and 


4000 . 

- fj 

7T 


so that the value on the fourth side, the arc AD where r = 1, is 


<f> = - 4000(log t , R)ln, 

and is hence a constant and completely determined. This applies to all such cases 
involving the two conjugate functions, the values of the functions can be specified 
on parts of the boundary only, and not all round, i.e. there must be an ‘open’ part 
of the boundary where the value of one function is unspecified, but must be constant. 


C 



Figure 23. Transformation of a sector. 


The procedure adopted therefore in the relaxation technique is to lay down values 
of the functions all over the field, including the ‘open’ part of the boundary, and 
calculate the residuals by use of the equations (19) and (20). The relaxation is then 
effected by elimination of the residuals one with another where possible, and should 
residuals of one sign predominate, ‘sweeping’ these residuals towards the ‘open’ 
part of the boundary, utilizing the sweeping effect of the relaxation patterns, to which 
reference has already been made. The unknown constant value of the function on 
this ‘open’ part of the boundary is thus altered, and finally attains the correct value. 

As an illustration of the method with straight boundaries, the problem of the 
electric field in a homogeneous isotropic medium of dielectric constant k between 
two long conductors of square section has been tackled (figure 24). Because of 
symmetry, only one-eighth of the field, the section A BCD, was considered. 

The electric field in a medium is governed by the equations 


dv 


dx 


CJK 

dy 


and 



where V is the potential and x a flux function, x 
electric force. 


dx* 

= constant giving the lines of 



Interlacing nets for relaxation methods 427 

Since k is a constant, these equations may be written as 


dx By \k J 


and 


d I » 

By dx\K/ 


which are formally equivalent to the Cauchy-Riemann equations. 

Hence the problem resolves itself to the determination of two conjugate functions, 
0 = V, and ^ = x/k, by the method indicated above. 



Figure 24. Electric field between two square conductors. 

The potentials of the inner and outer conductors were taken as 0 and 1000 V 
respectively, and clearly AD and BO are lines of electric force, hence on these lines 
ft is a constant. Since values of 0 and xjr cannot be specified on all parts of the 
boundary, the value \Jr =* 0 was taken on BC\ and the side AD left ‘open*. The 
squares were considered to have sides in the ratio 9:17, thus enabling AD to be a 
0-line and AB and DC to be 0-lincs. The problem thus specified presented no 
difficulty at all, and the solution, values of 0 ~ V and 0 ~ y//c, is given in figure 25, 

The above problem with the sides of the square conductors in the ratio 9:17 is 
rather artificial, a more practical aspect being obtained by taking the sides in the 
ratio 1 : 2 , An approximate solution to this latter problem has been obtained by 
Jeffreys ( 1946 ), but the problem can easily be solved using the relaxation technique. 
Referring to figure 24, the side AD on which 0 is a constant, is now a 0-line, so that 
the symmetrical property of the voltages about AD must be used, or, alternatively, 
the relaxation may be effected on a field twice the size. 

The symmetry may be used in the following way. Since the values of 0 on the 
adjacent 0-lines parallel to AD will be the same, it follows that the differences 
30/Sar, and hence d 0 /cty, will be equal in magnitude but opposite in sign. Introducing 
a row of fictitious 0 -points, therefore, the differences between these 0 -points must 
be kept equal in magnitude but opposite in sign to the corresponding differences 
between the 0 -points on the line just inside the field. 


28 


VoL 193* A. 




428 D. C. Gilles 

The relaxation can then be effected in the usual manner. This row of fictitious 
points will form the ‘ open ’ boundary, and an increment can be put on all the points 
to eliminate residuals of one kind or another. Whenever an increment is made to 



Figure 25. Electric field between two square conductors, sides in ratio 9:17. 



Figure 26. Electric field between two square conductors, sides in ratio 1:2. 


a value adjacent to the boundary AD, an equal and opposite one must be made 
to the corresponding fictitious point. The solution to the problem is given in figure 26. 

It is of interest in these two problems to calculate the charge on the inner conductor 
or the oapacitanoe per unit length. This may be done by considering Gauss’s theorem 




















Interlacing nets for relaxation methods 429 

over any closed curve containing the inner conductor. For obvious reasons the 
closed curves are here chosen to be squares concentric with the conductors. The 

integral, J( — 30/3n)ds, may then be evaluated around the four possible squares 

each bounded by ^-lines, using a formula for numerical integration, and the average 
value used to calculate the capacitance. Since the differences are comparatively 
small, the trapezium rule may be suitably applied. The values for the integrals taken 
over the section considered in the relaxation for the two problems are given below, 
that for the integral over the square nearest the inner conductor is given first. Henoe 
the capacitances per unit length (value of integral x 8 x 1/4 tt) may be evaluated and 
are 912*0 and 832*4 respectively. These compare favourably with the calculated 
values of 893*9 and 814*4 obtained by using a formula due to Bowman ( 1935 ), 
entailing an error of only 2 %. 


conductors 
in ratio 


values of integral 


average 


9:17 1435 1432 1432 1431 1432*5 

1:2 1308 1307-5 1306 1308*5 1307*5. 


12. From the above it can be seen that in this type of problem attention is focused 
only on given values of the wanted functions round the boundary and not upon their 
normal gradients. This naturally simplifies the boundary conditions, and it is with 
curved boundaries that the simplification is greatest and most welcome. 



Consider, as an illustration, part of the boundary shown in the diagram (figure 27) 
along which 0 is a constant. (An*open ’ boundary would entail no greater difficulty.) 

Thus at the points marked with a ring in the diagram, the residuals cannot be 
calculated because one or more of the required 0 - or 0 -values is missing. 

Whenever a 0 -line cuts the boundary, let us produce it to the corresponding 
fictitious point; these fictitious points will form an outer boundary in the problem. 






430 


D. C. Gilles 


As explained in § 2, the value at such a fictitious point may be obtained in teems 
of the value at the boundary and the next two 0-points by the relation 

2 l-£ 2(1-g) 


where the suffixes refer to the corresponding points in the diagram. 

Thus the values at the fictitious 0-points may be obtained. The ^-lines can be 
produced also to fictitious points with advantage in some instances, but it can be 
seen that this is necessary only when these points lie inside or on the ‘ outer ’ boundary 
formed by joining the 0-points. The values of f at such points will be found by the 
ordinary relaxation process, and no new features are there involved, although 
Residual points may then be introduced outside the normal boundary of the figure, 
a typical example being indicated by a cross in the diagram (figure 27), 

The example is really a test of this technique, and shows how it has been applied 
to the conformal transformation of a circular sector of angle previously referred 
to in § 11. The same data were used, and the inner circle, on which there was only one 
point, a0-point, was the * open’ boundary. Fictitious points were introduced outside 
the curved boundary, and treated as explained above. The result is shown in figure 28, 
and the values thrived from the analytic solution of this problem, namely 


4000, r , , 4000 

-“JJ-logejj and f ~ - 


are given for comparison in the same figure below those obtained by the relaxation 
method. It may be seen that they agree remarkably well, considering the coarseness 
of the net and the singularity of the solution at the origin, the greatest error being 
of the order of 5 %. 



Figure 28. Conformal transformation of a sector. 


[Note, The values obtained from the analytic solution are given below those obtained by 
the relaxation method.] 









Interlacing nets for relaxation methods 431 


The QUA8I-PLANE-HAKM0NIC EQUATION 


13. This equation which occurs frequently in problems in applied mathematics, 
is of the form ^ ^ 


dx 




(») 


where ^ is a known function of x and y t i.e, the values of the function for various 
values of x and y may be found, but clearly for this purpose, an analytic expression 
of the function is not necessary. 

The equation lends itself to treatment by the technique of interlacing nets in 
a similar manner to the plane harmonic equation and the conjugate plane harmonio 
functions. A function \[r may be introduced by the relations 


dd> 

*dx 



d(j> ddr 
X dy^~di' 


(23) 


thus satisfying the equation (22) for <j>. 

The equation satisfied by the function \jr is 


a 

dx 



(24) 


and is also a qu&si-plane-harmonie equation. It is from the equations (23) that 
the two wanted functions <j> and are found. 

Using the two interlacing nets as before, the continuous line for <j> and the broken 
line for \jr > a portion of the field may be considered with its typical $6, ijr and the A- 
and B -points as for the plane harmonic case (§9 and figure 21). 

At the point A, labelling the four surrounding points 1, 2, 3, 4 with an effective 
mesh length a, and centred at the point A , 






It 


Hence the first of the equations (23) becomes 

XaW 

where Xa the value of the function x at the point A . This equation may be written as 

I'a s XaWi- 0a) - 0V + 0* - (25) 

At the point B in a similar manner, expressions for d<f>/dy } dt/r/dx may be obtained, 
and the second of equations (23) written as 


X*(0*-04) = -(0i-0s) or G b 35 X«(0* ~ 0.) + 0i ~ 0a = <>■ (26) 

The relaxation patterns to be used are given in figure 29, and it can be seen that 
the function % is involved in only one of the patterns, that for 0, and then in a very 
simple manner—the residual increment at any point is numerically the value of 
the function x at that point. 



432 


D. CX Gilles 


The same ‘ sweep * of residual, first indicated in the patterns for the plane harmonic 
functions, is still present, and indeed most, if not all, of the technique involved there 
is still applicable here. Thus, for example, block-relaxation can be used as before, 
and the only residuals altered when such a block is applied are still those just outside 
the applied block. 

As an illustration of the method the problem of heat conduction in a hemisphere 
composed of a central hemisphere of radius with a conoentric shell of thickness 
i?, of different conductivities in the ratio 5 :1 with its flat surface at 1000° C and its 



where k is the conductivity and V the temperature. 


Xh 



X B > ** 

<j> -pattern ijr -pattern 


Figure 29. The relaxation patterns. 



Figure 30 


curved surface at 0° C (figure 30) was considered. The problem has axial symmetry, 
and using cylindrical co-ordinates the conduction of heat is governed by the equation 
To make this equation non-dimensional, one may write k ~ where k 0 is the 
smaller conductivity considered, and then 





433 


Interlacing nets for relaxation methods 


An orthogonal function, rjr , the flux function, may be introduced such that * 


dV ^djr 
dr dz 


and 



djr 

dr ' 


These equations are the typical quasi-plane-harmonic equations (23) and the known 
function x ** Taking a mesh length a « \R, a quadrant ABC was considered 
(figure 30), the boundary conditions being V — 1000 on AB y ^ = 0 on BC, F»0 
on CA and the open boundary a ^-point at A. Fictitious points were introduced 
outside the curved boundary CA and treated as in § 12. The relaxation involved no 
great difficulty, and the solution is given in figure 31. 


-4** -104 



Fiaumc 31. Conduction of heat in a hemisphere. 


RfiPBRKKCSS 

Allen & Southwell 1944 Proc. Boy. Soc. A, 183, 125-134. 

Bowman 1935 Proc. Lend. Math. Soc. 2 , 39, 205-215. 

Carter 1944 J. App. Meek. II, 162-167. 

Fox Sc Southwell 1945 Phil. Trans. A. 239, 419-460. 

Jeffreys 1946 Methods of mathematical physics. Cambridge Univ. Press. 

Kron 1944 J . App. Mech . 11, 149-161. 

Love 1927 The mathematical theory of elasticity , 4th ed. Cambridge Univ. Press. 

















The Wellcome Research Institution 

By C. H. Kellaway, F.R.8., Director-in-Chief 
(Lecture delivered 5 February J948— MS. received 3 March 1948) 

[Plates 5 to 8] 

The Wellcome Research Institution is the name which Sir Henry Wellcome used to 
describe collectively the Museums and the various research undertakings of the 
Wellcome Foundation Ltd. Before outlining the development of the individual 
units I must first briefly relate the history of the Wellcome Foundation itself. 

The firm of Burroughs Wellcome and Co. was founded in London in 1880 by two 
young American pharmacists, Silas M, Burroughs and Henry S. Wellcome. This 
proved a most successful undertaking, and after the death of Burroughs in 1895 
Wellcome became the sole owner of a flourishing business with commercial offioes 
in Snow Hill, chemical works at Dartford and the beginnings of his earliest research 
enterprise, the Wellcome Physiological Research Laboratories. 

The next 30 years saw the development not only of the business in this country 
but of associated houses in U.S.A. and other parts of the world, and in 1924 
Wellcome consolidated all his interests in a single company, the Wellcome 
Foundation Ltd. This fusion was done with a deliberate purpose later to be revealed 
by his will. Within the framework of the business itself he had already developed 
the laboratories for scientific research and the educational museums that I shall 
presently describe. Beyond these the will provided for scientific work outside the 
business itself, for, on Wellcome’s death in 1936, all the shares in this Company 
were vested in five trustees who were to receive the whole of the distributable 
profits and use them in certain specified ways for the advancement of research in 
medicine and its related subjects anywhere in the world. 


The Wellcome Physiological Research Laboratories 

When diphtheria antitoxin first came into use Mr Wellcome realized its 
importance and in 1895 set up laboratories in London for its manufacture. In 1898 
these were moved to Brockwell Hall, Heme Hill, which, with 10 acres of ground, 
stabling and other buildings, provided suitable accommodation not only for the 
manufacture of the new serum but for the more ambitious research projects Mr 
Welloome had in mind. These were foreshadowed by the name he gave the 
laboratories in 1895. 

The research history of this unit did not commence until the beginning of the 
present century, when, in spite of opposition by the Royal Colleges of Physicians 
and Surgeons, the premises at Brockwell Hall were registered as a place where 


»9 


VoU X93» A. (at July 1948) 


[ 435 J 



436 a H. KeUaway 

experiments on living animals might be performed. This was the first instance of 
such recognition for laboratories attached to a commercial firm. 

During 52 years this unit has had three periods of active research interrupted by 
two world wars, during each of which nearly all research, except what was essential 
to production, ceased. During the first period from 1900 to 1914 the laboratories 
were directed first by Dr Walter Dowson till 1906 and then by Dr H. H. Dale till 
the outbreak of the first world war. 

Dr Walter Dowson, though not himself an experienced scientific worker, was 
a capable administrator who succeeded in attracting young and well-trained 
scientific personnel and in creating good conditions of work for them. Among the 
early members of the scientific staff were a number of men who subsequently 
became Fellows of this Society—George Barger, H. H. Dale, A. J. Ewins, A. T. 
Glenny, P. P. Laidlaw and John Mellanby, and at a somewhat later period, 
Harold King and J. H. Bum. 

John Mellanby was the first pharmacologist appointed to the laboratories and 
spent several years there in studying the globulins and other proteins from horse 
plasma and serum. He described a method for concentrating diphtheria antitoxin 
without denaturation of the protein by treating the serum with alcohol at low 
temperatures—a principle which has recently been brilliantly developed and 
extended by Cohn and hia colleagues at Harvard. 

A. T. Glenny, who only last year retired from the staff, controlled the production 
of antisera at the laboratories for about 40 years but found time to utilize for 
research the wealth of material which was available to him as well as to carry out 
well-planned experiments on immunity. 

The association of Barger, Dale, Ewins and Laidlaw during these early years was 
a most fruitful one. When Dale joined the staff as pharmacologist in 1904 Barger 
had already commenced the investigation of ergot which not only yielded valuable 
direct results but led to studies of even greater importance. The alkaloid, ergotoxine, 
was discovered through the recognition of its curious action of adrenaline reversal. 
Histamine, which in the intervening years has proved to be of great interest in 
relation to anaphylaxis and various shock-like conditions and in the immediate 
reaction to injury, was isolated from extracts of ergot and its pharmacological 
action was analyzed by Dale and Laidlaw. Other amines also engaged attention, 
and Dale and Barger made an extensive study of the relation of their chemical 
structure to sympathomimetic action. 

The action of ergot in causing strong contraction of the uterus led Dale to the 
study of another powerful oxytocic present in extraots of the posterior lobe of the 
pituitary gland. This work formed the basis for the method of standardizing such 
extracts described by Dale and Laidlaw whioh is still in general use. 

The observation that a particular ergot extract had peculiar properties led to the 
identification in it of acetylcholine by Ewins. The discovery by Dale of the 
muscarine* and nicotine-like activities of this substance, by which it simulated 
effects of different groups of efferent nerves, provided the foundation for studies 



The Wellcome Research Institution 


437 


later resumed by Dale and his colleagues at the National Institute; these, with the 
researches of Otto Loewi, played a formative part in the modem conception of the 
chemical transmission of nervous stimuli. 

Other important researches were the examination of the anaphylactic response 
of the uterus of the sensitized guinea-pig by Dale and investigations of the hydrogen- 
ion concentration of biologioal fluids by G. S. Walpole, who first organized the 
concentration of antitoxin at the laboratories. 

The scientific staff at this time was small, less than a dozen, but they were a very 
active group, and during this period about 100 papers, many of them of classical 
importance, were published. 

The year immediately preceding the outbreak of the first world war broke up this 
happy company. Laidlaw went to the professorship of pathology at Guy's and Dale 
joined the research team which was being built up by the Medical Research Council, 
taking with him Barger and Ewins. Dale was succeeded by Dr R. A. O’Brien who, 
with a greatly reduced scientific staff, was confronted by the wartime problem of 
a large increase in production. During the war the production demand was fully 
met and large quantities of tetanus and diphtheria antitoxins, antigangrene sera 
and typhoid vaccines were made available for the forces. 

Shortly after the war Mr Wellcome bought a property of rather more than 
100 acres at Langley Court, Beckenham (figure 2, plate 5), and the move there) 
was made in 1922. During the period between the two world wars the scientific 
staff gradually increased in number, reaching a total of 28 in 1939. Of the staff in this 
interwar period three more were later to become Fellows of this Society—Percival 
Hartley, J. H. Gaddum and J. W. Trevan. 

Between 1921 and 1939 over 300 papers were published. Glenny now had his 
most productive period. He had already discovered the difference in effect between 
primary and secondary stimuli in active immunization, though the publication of 
this work had been delayed, and he now contributed year by year a series of papers 
on active and passive immunization mainly concerned with the response to 
diphtheria toxin. With Pope and Miss Hopkins he was early in the field in the 
conversion of toxin to toxoid by formaldehyde, and he and his colleagues discovered 
the advantageous effect of precipitating toxin with alum. Alum-precipitated toxoid 
(a.p.t.), and indeed other alum-precipitated toxoids, have since been very widely 
used for immunization. 

O'Brien was the first in this country to carry out Schick testing and was a pioneer 
advocate of active immunization against diphtheria. DaUing’s work contributed 
much to the control of lamb dysentery, a disease which formerly killed many 
thousands of lambs eaoh year in Britain. He identified the causative organism and 
showed that the pregnant ewe could be actively immunized by vaccination and 
could transmit effective immunity to its offspring. With Glenny he elaborated an 
antitoxic serum for the passive protection of newly bom lambs. 

Hartley during 1920 and 1921 developed a medium for the production of diphtheria 
toxin which has found worldwide use and made possible uniform production of 



438 


C. H. Kellaway 

potent diphtheria toxin. This work has been the basis of the great improvements in 
toxin production which have been made since that time. He was also a pioneer'in 
the rapid drying at low temperatures of sera and other protein solutions, developing 
a method which he found most useful during the next 25 years when he was in 
charge of the Standards Laboratory at the National Institute for Medical Research. 

Dalling made generally available the method of immunization against dog 
distemper which has been developed by the researches of Laidlaw and Dunkin at 
the National Institute for Medical Research. 

Pope made an important contribution in developing a method for concentrating 
antitoxic sera which depended upon preliminary digestion and heat denaturation of 
unwanted proteins. The use of this and other methods of serum concentration has 
greatly diminished the inoidence of serum sickness after the therapeutic use of 
antitoxin. 

During this period much attention was directed to accurate methods of standard¬ 
ization of therapeutic substances, and Trevan, in 1927, published an important 
paper on the statistical basis of determinations of the lethal dose and other 
biological end-point reactions, which has had a wide influence on the design of 
methods of bio-assay. 

During the period immediately before the second world war Buttle and his 
associates were actively engaged in studies on chemotherapy, particularly on 
sulphone derivatives and sulphonamide. 

During the early part of the recent war, under the direction of Dr R. A. O’Brien 
and after 1941 under Dr J. W. Trevan, the laboratories were faced with an even 
larger production demand than in the first world war. Vast quantities were made 
and issued of tetanus and gas gangrene antitoxins, tetanus toxoid, and the major 
part of the diphtheria prophylactics used by the Ministry of Health in its free 
immunization campaign. Additional laboratory accommodation was needed and 
the L.C.C. made available their laboratories at Carshalton and Belmont, Over 
1000 horses had to be maintained for the manufacture of sera and stabling accom¬ 
modation was aoquired in various parts of the country, some even so far afield as 
Doncaster. When the Veterinary Research Station was established at Frant, in 
Sussex, in 1943 some further stabling accommodation had to be provided there. 

During the latter part of the war a large mechanical surface culture plant was 
set up at Beckenham for the manufacture of penicillin, but in 1945, when it became 
apparent that manufacture by the bottle process could not compete economically 
with deep culture, its use was discontinued. Before it was closed down some 
streptomycin was produced for experimental purposes in a portion of the plant. 
Work on antibiotics was now confined to search for new ones in a small unit at 
Belmont, which had been used for the manufacture of penicillin, and later in the 
main laboratories at Beckenham. 

At the end of the war diminished production again made possible increased 
research activity and there has been a large increase in the number of papers 
issuing from these laboratories. The most important recent work of which mention 



The Wellcome Research Institution 


439 


should be made are further studies on immunity, on the toxins of the CL wdchii 
group, on the pharmacology of ‘ Sulphetrone 5 and streptomycin, work on the pro¬ 
duction of diphtheria toxin by deep culture, the development of a new method of 
refinement applicable to antibacterial sera which depends on precipitation with 
ammonium sulphate in the presence of tricresol, and studies on' Aerosporin’, a new 
antibiotic which covers part of the bacterial spectrum not adequately covered by 
either penicillin or streptomycin. Aerosporin appears not readily to allow of the 
production of resistant bacterial strains, and gives promise of real value in the 
treatment of whooping cough and of some other diseases caused by Gram-negative 
organisms. 

The Wellcome Veterinary Research Station 

By the end of 1943 it had become apparent that the property at Beckenham was 
not adequate for the satisfactory breeding of the large number of small animals 
needed for testing and research, and a farm of more than 300 acres was purchased 
at Ely Grange, Frant (figure 7, plate 8). Here small animals could be bred under 
isolated conditions, particularly ferrets for developmental research on distemper 
products. Here too the green food needed for stock animals and those under 
experiment at Beckenham could be grown and a suitable place was provided 
for the investigation of diseases of larger animals, particularly of sheep and cattle. 
The scientific staff, under the direction of Dr R. F. Montgomerie, has ranged from 
three to six in number and the chief problems so far studied there have been 
mastitis in milking herds, bovine infertility, problems connected with small animal 
breeding and the study of immunity against louping ill. At the end of 1946 
a temporary Artificial Insemination Centre was set up there in collaboration with 
the Milk Marketing Board. 

The smooth development of the Franfc farm as a veterinary research station 
suffered somewhat from the war enterprise given the secret name of the 'Tyburn’ 
operation, which was carried out there in 1946. By the end of 1944 it had become 
apparent that scrub typhus might be a serious menace to jungle operations in 
south-east Asia and elsewhere, and the War Office decided to have made on a large ’ 
scale the scrub typhus vaccine which had teen devised by Fulton and Joyner at 
the National Institute for Medical Research using the lungs of cotton rats infected 
with the causative rickettsiae. The Ministry of Supply with the advioe of the 
Medical Research Council requested the Wellcome Foundation to undertake this 
task and the Medical Research Council seconded to us Dr M. van den Ende to take 
charge of the scientific side of the operation and direct the developmental and other 
researches which were carried out during its progress. The manufacture of the 
vaccine involved the importation by air from America of thousands of cotton rats' 
and the simultaneous breeding of large numbers of these animals in this country. 
Ely Grange, Frant was selected for the enterprise as being sufficiently isolated and 
as providing accommodation for the workers engaged in a difficult and dangerous 
project. The scientific personnel, nine in number, were partly civilian and partly 



440 


C. H. Kellaway 

officers of the R.A.M.C. and the A.T.S. and the sixty technical personnel were 
R.A.M.C. laboratory technicians and A.T.S. who volunteered for the work. 

The laboratories were built in 109 days and the preparation of vaccine was 
started on 1 May 1945. Between that date and 31 October, when manufacture was 
discontinued, 300 1. of vaccine, enough to vaccinate 100,000 men, were made. This 
was a highly dangerous technical operation involving inoculation of cotton rats by 
insufflation with a seed suspension of rickettsiae from the lungs of mice infected in 
the same way. The rats were kept for several days and died or were killed, the lungs 
being harvested and emulsified to make the vaccine which was treated with 
formalin to kill the rickettsiae. The workers engaged in the manufacture were 
vaccinated and the most dangerous operations were carried out in cabinets venti¬ 
lated by suction, the air from these being sterilized by special furnaces on the roof 
of the laboratories. A good many other precautions were used—the air in the 
laboratories was treated by aerosols of lactic acid and very carefully controlled 
procedures were laid down to minimize risk. Despite these precautions, four 
workers were accidentally infected. All had, of course, been protected by vaccination 
and fortunately made uneventful recoveries. 

The Wellcome Chemical Research Laboratories 

These Laboratories were founded in London in 1896 and remained at 6 King 
Street, Snow Hill for 35 years. In 1923 they were transferred to the new building 
of the Wellcome Research Institution at 183 Euston Road where a portion of the 
staff is still housed as part of the Laboratories of Tropical Medicine (figure 5, plate 7). 

When the Laboratories were founded, Mr Wellcome persuaded his friend, F. B. 
Power, to come over from America and take charge of them. During Power’s 
directorship work was mainly concentrated on increasing systematic knowledge of 
the chemical constituents of plants used in medicine, and a wide range of these 
was investigated. During this period in the development laboratories at Dartford 
Dr H. A. D. Jowett carried out his work on the constitution of pilocarpine which was 
shown to contain a glyoxaline nucleus, Dr F. L. Pyman synthesized two other 
physiologically important glyoxaline derivatives, histidine and histamine, and 
Jowett and Pyman did their work on the tropine esters. 

When Pyman followed Power as Director of the Chemical Research Laboratories 
in 1914 attention was focused on synthetic drugs and alkaloids, Pyman continued 
the fruitful studies commenced at Dartford on the ipecacuanha alkaloids. Dr 
Harold King carried out his important work on the resolution of the optically 
active components of hyoscine. The Laboratories during the first world war were 
also faced with the problem of devising means of replacing German synthetic drugs, 
the most important of which was arsphenamine. 

When Dr Henry became Director in 1919 the main interest of the Laboratories 
was turned towards the chemotherapy of tropical disease. Some synthetic 
compounds were prepared for the treatment of hookworm And the investigation of 



The Wellcome Research Institution 


441 


chenopodium oil which was being extensively used for this purpose showed that its 
only active component was ascaridole. Work on the antimony compounds for the 
treatment of kala azar resulted in the synthesis of the N-glucoside of p-aminophenyl 
stibonic acid (‘Neostam’). Just before the second world war a new pentavalent 
antimony compound was devised in Germany for the treatment of kala azar, and 
early in the war the composition of this drug was determined and a process designed 
for its manufacture. 

Work on antimalarials included an extensive investigation of the alkaloids of the 
genus Alstonia and a study, carried out on behalf of the Malaria Commission of the 
League of Nations, of the composition and standardization of the quinine alkaloids. 

The 4-4'-diaminodiphenyteulphone derivative, * Sulphetrone \ which shows 
promise in the treatment of leprosy was also synthesized in these laboratories. 

Meanwhile at the Development Laboratories at Dartford, under Dr Sydney 
Smith’s direction, research on ephedra led to the isolation of two new alkaloids. Here 
too Dr Smith isolated digoxin, the pure crystalline glucoside of Digitalis lanata . 

When Dr Henry retired in 1944 Dr Smith took charge of the main laboratories 
which were to be established at Beckenham when the new building (figure 6, 
plate 7) was completed. In the meanwhile accommodation was found for Dr Smith 
and three of his colleagues in one of the more modem laboratories at Langley Court 
where this team worked on penicillin synthesis making an important contribution 
in the isolation of penillic acid. The move into the new building took place in 1946, 
a portion of it being used to house part of the chemotherapeutic testing unit. 

The new laboratories have been working for too short a period for more than the 
most general report of their activities to be made. A number of new phenanthri- 
dinium compounds, active against more than one species of trypanosome, have 
been synthesized, interesting studies have been made on the chemistry of the new 
antibiotic, Aerosporin, and work on substitutes for d-tubocurarine and on analgesics 
and antiasthmatics has been in progress. 

The Wellcome Laboratories of Tropical Medicine 

Mr Wellcome’s first venture into tropical research was conducted to some extent 
vicariously. In 1901 he offered to the Gordon Memorial College, Khartoum, 
equipment for chemical and bacteriological laboratories, the College being respon¬ 
sible for the building and for the selection and support of the scientific staff. The 
Wellcome Tropical Research Laboratories so founded continued until 1935 when 
they were closed down by the Sudan Government, the space occupied by them being 
needed for other purposes. The first Director of these Laboratories was Dr Andrew 
Balfour. In 1907 Mr Wellcome provided a floating laboratory to carry research to 
regions which were otherwise not readily accessible and Dr C. M. Wenyon was 
placed in charge of this. 

In 1913 the Wellcome Bureau of Scientific Research was founded with Andrew 
Balfour as Director and as Director-in-Chief of all the various research units which 



442 C. EL Kellaway 

Mr Wellcome had set up. The Bureau was to be a centre in London for the study of 
tropical disease with the important function of supplying information to workers 
abroad, and its staff included C. M. Wenyon as protozoologist, A. C. Stevenson as 
pathologist and M. E. MacGregor as entomologist. 

Shortly after the Bureau had been established the first world war broke out and 
the Bureau was placed at the disposal of the War Office. It played an important 
part in training medical officers in tropical disease and Dr Balfour wrote the well- 
known 'Memoranda on Medical Diseases in the Tropical and Sub-Tropioal War 
Areas \ Balfour and Wenyon spent most of the war years at various fronts mainly 
acting as consultants ; and Wenyon did his important work on amoebic dysentery 
in Egypt and on malaria in Macedonia. 

During the war the development of the Bureau was necessarily inhibited, but 
when the war ended its staff was increased and in 1920 it moved from Henrietta 
Street to more commodious premises at Endsleigh Gardens. From this time till 
the present it has had generally 9 to 11 senior scientific workers on its staff. 

In 1923 Balfour resigned, becoming Director of the newly-created London School 
of Hygiene and Tropical Medicine, and Wenyon succeeded him as Director of the 
Bureau of Scientific Research and Director-in-Chief of the Wellcome Research 
Institution. The policy was followed of seconding workers to various laboratories 
abroad; for example, in 1927 C. A. Hoare went to work on trypanosomiasis in the 
Sleeping Sickness Laboratory at Entebbe, and later G. M. Findlay made numerous 
ventures in different parts of the world to study yellow fever and other diseases. 
The Bureau was also available for guest workers, and it is of interest that in 1928 
Dr E. Hindi© began there his work on yellow fever for the Yellow Fever Commission 
of the Colonial Office, in which he carried out the first preventive vaccinations on 
monkeys with formalinized virus. 

In 1932 the building in the Euston Road (figure 1, plate 5) was erected to house 
the Museums, the Bureau and the Chemical Research Laboratories. In their new 
quarters work was now begun on virus diseases, and on leptospiral infections. The 
manufacture of yellow fever vaccine on a laboratory scale was started by G. M. 
Findlay, who had visited the United States to study the methods of research and 
manufacture at the Rockefeller Institute. Numbers of service personnel were 
inooulated year by year and, when the second world war broke out, free inoculation 
was continued on a large scale subsidized to some extent by the Colonial Office 
until the end of the war. 

The problem of infective hepatitis and of serum jaundice had in the intervening 
years engaged the attention of workers at the Bureau and during the war F. 0. 
MacCallum was seconded to work at Cambridge on a special enquiry concerning 
infective hepatitis, financed by the Medical Research Council. 

In 1944 Dr Wenyon retired and the departmental controls were reorganized. 

I joined the staff as Director-in-Chief and Dr N. H. Fairley was invited to become 
Director of the Laboratories of Tropical Medicine and also professor in the new Chair 
of Tropical Medicine founded by the Wellcome Trustees at the London School of 



The Wellcome Research Institution 


443 


Hygiene and Tropical Medicine* Fairley, however, found himself so fully occupied 
by his professorial duties that he resigned and was succeeded late in 1046 by 
Brigadier J. S. K. Boyd who had just retired from the directorate of pathology at 
the War Office, 

The Bureau of Scientific Research, or to use its present name, the Wellcome 
Laboratories of Tropical Medicine, during its 34 years existence has had a dis¬ 
tinguished record in research in tropical disease, and 8 books and more than 560 
papers have been contributed by members of its staff, the most important being 
C. M. Wenyon's classical Manual of Protozoology published in 1926. 

The main problems on which work is now in progress concern bacteriophages, the 
tissue culture of protozoa, the serology of leptospirosis and the chemotherapy of 
tropical diseases (figure 4, plate 6). 

The policy of sending workers abroad is being continued—Dr J. M. Watson having 
spent most of last year in Egypt studying bilharzia, a member of the Egyptian 
State Health Service having been an exchange guest worker in the laboratories 
here. 

Closely associated with the Bureau are the Entomological Field Laboratories 
which were opened at Wisley in 1920 by Mr M. E. MacGregor. In 1921 the labora¬ 
tories were moved to new premises at Esher in Surrey where for the last few years 
Mr B. Jobling has been carrying on his beautiful anatomical and histological studies 
on the blood-sucking diptera. At the present time when other large and well- 
equipped laboratories for entomological study are available in this country there 
seems little justification for the continuation of this small unit. 

The Wellcome Museum of Medical Science 

The Museum of Medical Science arose primarily from the interest of Andrew 
Balfour in teaching museums. He had started such a museum at Gordon College 
and as soon as he took charge of the Bureau in 1913 he commenced to develop one 
there. The real history of the Museum began when Dr S. H. Daukes, who had been 
appointed Curator in 1919, became Director of the Museum and in 1923 extended 
its scope to include diseases of temperate as well as tropical climates with the object 
of presenting an up-to-the-moment picture of advances in medicine. 

The Museum grew rapidly and in 1932 three spacious galleries (figure 8, plate 8) 
were provided for it in the new building in Euston Road. Here Dr Daukes set it 
out in admirable fashion and secured a wealth of material illustrating the aetiology, 
symptomatology, pathology, treatment and prevention of disease. There were also 
sections on general and special pathology. The Museum was increasingly used by 
students, doctors, and nurses and others and in the year before the second world 
war began it had over 10,000 visitors. 

After the outbreak of the second world war, when it appeared likely that London 
would be bombed, the Museum was dismantled and the valuable collection stored. 
In 1945, when this threat was removed, it was possible to re-establish it in part 



444 C. H. Kellaway 

despite considerable damage to screens and windows caused by neighbouring bomb 
explosions. A beginning was made with the East wing, in whioh much of the 
tropical material is set up, and Dr Daukes was able to complete this before he 
retired at the end of 1945. He was succeeded by Dr C. J. Hackett, who, before 
continuing the establishment of the Museum, decided to modernize its presentation. 
The system introduced by Daukes could not be bettered, but improvement in 
lighting, with re-arrangement enabling visitors more readily to grasp the essential 
features, will undoubtedly increase the value of this much-used collection. It will 
unfortunately be several years before the Museum is complete and up-to-date in 
every part. 

* 

The Wellcome Historical Medical Museum 

The story of the development of this museum is really a part of the story of 
Henry Wellcomo's life. In 1905, having arranged his affairs so as to obtain more 
leisure, he started collecting all kinds of material connected with the history of 
medicine, and the idea of creating a great museum of medical history became his 
keenest interest till the end of his life. 

In 1913, he found housing for his collection in Wigmore Street, and displayed 
there much of his best material. The collection, which was used for a number 
of important exhibitions, remained there till 1932, when the new building in 
Euston Road was ready for its reception. From 1924 onwards Sir Henry Wellcome 
devoted increasing time and energy to the collection of material. After the move 
to the new building the Museum was not at once opened, but the work of sorting 
the material was begun. Sir Henry Wellcome, who had just been elected a Fellow of 
this Society, had hoped finally to settle the layout, but he died in 1936 with his 
plan for this still uncompleted. 

In his lifetime Sir Henry Wellcome himself was Director of this Museum and it 
was not till 1943 that Dr Daukes, who till his retirement in 1945 acted as Director 
of both Museums, proposed a plan for its re-arrangement when this could again be 
undertaken. At the end of 1945 this plan was in the course of being put into effect. 
In 1946 Dr E. Ashworth Underwood, a distinguished scholar of medical history, 
was appointed Director in succession to Dr Daukes and made plans to open not 
only the Museum but also the Library of 150,000 volumes, which till then had not 
been available for consultation, the work of cataloguing it being only about two- 
thirds completed. 

Unhappily last year the Museum had a further and most disappointing setback, 
which will delay for some years its establishment in the building in which Wellcome 
had planned to house it. In 1940 when the central offices of the business at Snow 
Hill were completely destroyed it was necessary that its administrative staff should 
be given immediate accommodation within the building in Euston Road. This was 
not a serious matter during the war, since considerations of safety of the valuable 
material limited the use of the galleries as a Museum. In due course premises were 
found on short lease for housing most of the business and administrative staff, in 



The Wellcome Research Institution 445 

the expectation that at the end of the war new accommodation could be built to 
take the place of the offices destroyed by enemy action. 

When in 1947 the temporary lease of these premises came to an end, prolonged 
and active search during two years through the whole of London for suitable 
premises having failed and the prospect of building new offices being removed to 
an uncertain future, nothing Remained but to house the business activities in the 
building in Euston Road and to put most of the Museum material into store, until 
such time as the galleries once more become available for its display. 

Fortunately, however, the Wellcome Trustees have been able to secure accom¬ 
modation to house Dr Underwood and the Museum staff, and to afford them 
opportunity to continue their researches, as well as to give limited public exhibitions 
of valuable parts of the collection from time to time. The Library, though not 
ideally set up, remains in the Euston Road building and will be available for 
consultation by scholars interested in the history of medicine. 

Having now traced the history of this rather complex research organization 
which Henry Wellcome built up it remains to get its structure into perspective and 
to indicate the causes of its early success. Wellcome was an autocrat and during his 
lifetime rigidly controlled the organization in the directions he desired. He early 
recognized the importance of scientific research for the development of his business 
interests and in the process gained an enthusiasm for the application of science in all 
three of the ways Sir Edward Appleton has recently described, ‘applied research, 
objective fundamental research and free fundamental research’. The scientific 
workers whom he engaged were allowed to work largely on subjects of their own 
choice although he sometimes suggested topics in which he was particularly 
interested. They were also at liberty to publish the results of their researches and 
were thus not detached by rules of secrecy from their academic colleagues. This, 
fifty years ago, must have been an unusual situation to be occupied by men who 
had been attracted to industrial laboratories. The application of this policy was in 
the main responsible for the scientific distinction of the workers in the laboratories 
and for the high reputation of the laboratories themselves. 

Wellcome’s interest in education resulted in the development of the Museum and 
Library of Historical Medicine, the Bureau of Tropical Medicine and the Museum 
of Medical Science. The close collaboration of these last two units with the Tropical 
Diseases Hospital and the London School of Hygiene and Tropical Medicine has 
helped to make London an important centre for the study of tropical disease. 

Though these educational units were within the framework of the business, 
Wellcome made some attempt to set them apart when he designed the great building 
in Euston Road to house them, but the second world war and its aftermath have 
prevented the smooth development of his large conception. 

In the meanwhile the research organization must be directed towards increasing 
the prosperity of the business and the funds available to the Trustees for the broad 
purposes indicated by the Founder’s will. In this country the Foundation has 
extensive and well-equipped laboratories and at the present time has a staff of 



446 a H. Kellaway 

about 80 graduate scientific workers, a large number of skilled technicians and 
many young men and women who are acquiring university degrees while they are 
engaged as technical assistants. This does not include the staff of the Chemical and 
Pharmaceutical Development Laboratories at the Works at Dartford who are 
responsible, under the direction of Dr Denis Wheeler, for the improvement of the 
processes in operation there as well as for development to the stage of production 
of projects handed over by the research staff. On the biological side there is no 
segregation of development and a good many of the scientific workers at the 
Beckenham laboratories are engaged in production of biological products for the 
improvement of which continuous developmental work is required. 

As the research organization increased in size there was until recent years some 
tendency for units, and even for departments within units, to work independently. 
THis has been discouraged and a degree of plasticity has been achieved which 
enables a number of workers to be concentrated on any urgent problem. Our 
current researches are oo-ordinated with those being carried out in the research 
unit in our New York company and each part of the organization is being made 
aware of what is going on elsewhere. This important objective is being achieved by 
the circulation of regular reports on the progress of research projects, by visits 
between the units and by annual research meetings, attended by all the workers 
at which the current researches are reviewed and new projects indicated. In 
addition to these more general meetings the workers engaged in each project meet 
for frequent consultation and from time to time their projects are reviewed more 
formally, in the light of the current project reports, by myself and the heads of the 
research units concerned. 

Publication remains free and is encouraged, and, though the bulk of work done 
is directed specifically towards the discovery of new therapeutic agents and there¬ 
fore comes into the category of applied researoh, fundamental studies of the 
objective type form an important part of our programme. 

Description op Plates 6 to 8 
Plate 6 

Figure 1 . The building of the Wellcome Research Institution, 183 Euston Road, N.W. 1 . 

Figure 2. Aerial view of the Wellcome Research Laboratories, Langley Court, Beokenham, 
Kent. Wellcome Physiological Research Laboratories 1 to 9 and 11. 1, Penicillin 
building; 2, Administration hlook; 3, Biochemistry laboratories; 4, Pharmacology 
laboratories; 5, Immunology laboratories; 0, Veterinary department; 7, Bacteriology 
laboratories; 8, Media Production Unit; 0, Chemotherapy section; 11, Veterinary stables; 
10, Wellcome Chemical Research Laboratories. 

Plate 6 

Figure 3. A laboratory for chemotherapy of tuberculosis at the Wellcome Physiological 
Research Laboratories, 

Figure 4, A laboratory for chemotherapy of tropical diseases in the Wellcome Laboratories 
for Tropical Medioine. 



Kellaway 


Proc . Roy, Hoc. .1, volume I Wl* plate 



Figure 2 


i Farina 44(1) 

















Fioube 4 












Fi<;xiRE 0 






Kellatmy 


Proc. Hoy. Hoc. A , volume 193, plate 8 






447 


The Wellcome Research Institution 

Plate 7 

Flora® 5. A chemical laboratory in the Wellcome Laboratories for Tropical Medicine. 
Flora® 6. The Wellcome Chemical Research Laboratories, Langley Court, Beckenham. 

Plate 8 

Figure 7. Aerial view of the Wellcome Veterinary Research Station, Ely Orange, Frant, 
Sussex. 1, Farm buildings; 2, Ely Grange; 8, ‘Tyburn’; 4, Small animal breeding units; 
5, Stables. 

Flora® 8. A portion of the Wellcome Museum of Medical Science at 183 Euston Road, N.W. 1. 


The electronic structure of conjugated systems. 
Parts III and IV 

III. Bond orders in unsaturated molecules 

By C. A. Coflson, King's College , London and H. C. Longfet-Higgins, 

Balliol College , Oxford 

{Communicated by R. P. Bell , F.RM.—Received 7 August 1947) 


This part ia concerned with the changes in bond order that take place in a conjugated system 
when the resonance integral of a bond, or the coulomb term of an atom is altered. It is shown 
that the sign of the mutual polarizability of two bonds in a chain depends upon whether 
there is an even or odd number of other bonds between them. Numerical values are given for 
mutual bond polarizabilities in some important hydrocarbons, and it is found that in the 
polyenes the effect of a perturbation decreases rapidly with distance along the ohain. The 
effect of hetero-substitution on bond orders is considered, and the results compared with the 
predictions of the qualitative resonance theory. 


Introduction 

The definitions of mobile bond order and total bond order given in part I were 
originally introduced by Goulson ( 1939 ), who calculated bond orders for a number 
of unsaturated hydrocarbons and found a smooth relation between bond order and 
bond length. In this paper we shall be primarily interested in changes in bond order, 
consequent upon ohanges in Coulomb terms or resonance integrals. Such changes 
oocur when, for example, a molecule vibrates, owing to the variation of resonance 
integral with bond length, or when one or more atoms in a molecule are altered, as 
in chemioal substitution. In this investigation we shall use the notation and nomen¬ 
clature given in part I of this series; in particular, the concept of mutual bond 
polarizability is helpful when we are discussing changes in bond order arising from 



448 


C. A, Coukon and H. C. Longuet-Higgins 

small changes in bond length. It is therefore convenient to begin by establishing 
a few general results about mutual bond polarizabilities: in parts IV and V these 
results will be applied quantitatively to particular problems. 

Variation of bond orders with resonance integrals 

In the previous part it was shown that in a rather special type of hydrocarbon— 
conveniently described as an alternant hydrocarbon—the mutual polarizability of 
two atoms was positive or negative according as the atoms were separated by an 
even or odd number of unsaturated bonds. A similar result will now be proved 
about the mutual polarizability of two bonds in an alternant hydrocarbon. 

Let A (n) denote the secular determinant for an alternant hydrocarbon with n 
unsaturated carbon atoms. Then if n is even, A {n) (iy) = A (n) ( — iy) y by equation ( 6 ), 
part II. Therefore if y is real, A (n) (ty) is real. But A has no zeroes on the imaginary 
axis, by hypothesis; therefore A in) (iy) is constant in sign. But the first term in 
A (w) (ty) is ( — iy) n . Therefore if n *= 4p, A {n) (iy) is real and positive, whereas if 
n sa 4p -f 2 , A in) (iy) is real and negative. On the other hand, if n is odd, A {n) (iy) is iy 
times some real, polynomial in y a , since A in) (iy) =* - A (n) ( — iy). Now A (n) (iy) is zero 
at the origin, but nowhere else on the imaginary axis; therefore the real polynomial 
is constant in sign. But the first term of A in) (iy) is ( — iy) n ; therefore if n « 4p*f 1 , 
A <n) (iy) « - iyP, where P is a real positive function, whereas if + 

A (n) (iy) = iyP, where P is real and positive. To summarize, A {n \iy) =* P, — iyP, — P 
or iyP according as n ss 0 , 1 , 2 or 3 (modulo 4), where P is real and positive for all 
real values of y. ( 1 ) 

It follows that A (m) (iy)/A <r,) (iy) = Q , iyQ , — Q or - iyQ according as n — m m 0 , 1 ,2 
or 3 (modulo 4), where Q is real and positive for all real values of y except possibly 
y * 0, where it may become infinite. ( 2 ) 

Now consider a conjugated system containing two bonds rs and tu such that all 
routes from atom r to atom u traverse atoms 8 and t (see figure 1 ). Let l be the 
number of bonds along any route between atoms 8 and t. Then by I (78) 

\ t - (~) r+t 

A ru - (- S( - Pf * s A (n ~ , " 8) , I 

and A*, - , 

the summations being taken over all routes between atoms s and t. (We have put 
all the F B equal only for the sake of clarity.) 



Fiouhe 1 









450 


C. A. Coulson and TL C. Longuet-Higgins 


But it must be recognized that in molecules where there is a great disparity between 
the various bond lengths, the numerical results obtained below are liable to a certain 
error. This is not because the formulae (e.g. equation (8)) are themselves invalid— 
they are not, since, for example, (8) holds even if the resonance integrals are not all 
equal—but because in practice we nearly always make our calculations of the coeffi¬ 
cients c rj under the assumption that every /?. For aromatic and condensed 
systems generally the errors in such an assumption are likely to be very small; 
but for butadiene (see later) they may be appreciable. 

Now by equation 1(19) p r6 « 2 £ (7) 


and by II (10), for an alternant hydrocarbon 

, £ *(c r jC >k -C $j C rk )(C tj C uk - 

"r*<u - 4 2 j 2 -7"77- 

j<k 


’ c u} c th ) 


( 8 ) 


From these equations and a knowledge of the structure of the molecule one can 
therefore calculate mobile bond orders directly, and m.b.p.’s as multiples of 1//?. 
The results of such calculations for some important hydrocarbons are given below: 
Benzene, The bond orders are all £, and the m.b.p.’s are: 


13 -11 7 -5 

n it,ii — 04^7 > " 12.34 - 64/y > " 12.45 - - 5 ^' 


8 I 



"1CT 
5 4 


2 

3 


Figure 3 


Naphthalene (see figure 3), The mobile bond orders are: 

Pn “ 0-725, ss 0-603, p J9 = 0*555, —0*518. 

The m.b.p.’s may be found from table 1 by multiplying the figures given by 1//?. 
The column headings give one bond and the row headings the other. 


Table 1 



10-9 

9-1 

1-2 

2-3 

10- 9 

0*253 

-0*109 

0*066 

-0*041 

9- 1 

— 0*109 

0*288 

-0*191 

0*111 

1- 2 

0*066 

-0*191 

0*209 

-0*209 

2r- 3 

-0*042 

0*111 

-0*209 

0*295 

3- 4 

0*066 

- 0*074 

0*124 

-0*209 

4-10 

-0*109 

0*096 

-0*074 

0*111 

10- 5 

-0*109 

-0*026 

0*025 

-0*045 

5- 6 

0*066 

0*025 

-0*020 

0*032 

a- 7 

-0*042 

- 0*045 

0*032 

-0*032 

7- S 

0*066 

0*059 

-0*032 

0*032 

8- 9 

-0*109 

-0*195 

0*059 

-0*045 



451 


The electronic structure of conjugated systems . Ill 


It is interesting to observe first, that the m.b.p.’s involving a given bond alternate 
in sign as we move away from the bond, and secondly, that the interactions between 
bonds in different rings are not as small as might have been expected by analogy 
with mutual atom polarizabilities. 


K> 

8 7 
FlGtTBE 4 


Styrene (see figure 4). The mobile bond orders in styrene are: 

p 12 ~ 0-91J, p2§ — 0*406, ^ — O'QIO, p 46 = 0*679, p^ = 0*059. 
Values oi fix n ratu are given in table 2. 


Table 2 



1-2 

2-3 

3-4 

4-5 

6-6 

1-2 

0-073 

- 0-165 

0-044 

-0-011 

0-009 

2-3 

— 0-166 

0-338 

- 0-099 

0-021 

- 0-014 

3-4 

0-044 

- 0-099 

0-249 

-0-190 

0-123 

4-6 

-0*011 

0-021 

-0*190 

0*247 

-0-214 

6-6 

0-009 

-0-014 

0-123 

-0*214 

0-263 

6-7 

0-009 

-0*014 

— 0-081 

0*127 

-0-202 

7-8 

-0-011 

0-021 

0-112 

- 0*091 

0-127 

8-3 

0*044 

- 0-099 

-0-157 

0*112 

-0-081 


Here the m.b.p.’s of bonds in the ring are close to those in benzene. Also, it will 
be observed that the self-polarizability of a bond is smaller the nearer the mobile 
order lies to unity. 

The polyenes (general formula GH a (CH) n _ 2 CH a ). Coulson showed that the 
secular equations are satisfied by 


e <- 2 ‘ 9oo, ;fi' 


c Tj = /(■■ ---) sin —^r 
1 V \n +1 / n + 1 

and that when n is even the bond orders are given by 


(») 

( 10 ) 


Pr.r+X ~ 2 { 


cosec 


7T 


2n + 2 


+<-)’ 


. . (2r+l)w) 

“•“-aTfri- (11) 


Coulson did not discuss the case of n odd. In such molecules the unpaired electron 
makes no contribution to the bond orders, and so 


i(rt-l) 1 / 7T 

JW- 2 S %<w» - jn(“‘5,T2 +( - ) ' 


(2r +1) 7T 
2 » + 2 


( 12 ) 


VoK 193, A, 


30 



452 


C. A. Coulson and H, C. Longuet-Higgins 

Both the right-hand side of (11) and that of (12) are unaltered when we write 
n — r + 1 for r ; hence the bond orders are symmetric about the central atom or bond 
in the chain. The mobile bond orders are alternately greater and less than 

—cot (or cosec) 

» + 1 } 2n + 2 


in ^ich half of the chain, the differences being smaller near the middle; and the 
mobile order of a bond near the middle of a long polyene chain is approximately 
2/7T, whether n is even or odd. 

By equation (61) of I, the mutual polarizability of a pair of bonds in a polyene is 


* r», tu 


1 f 00 lS 

. I 

nj-oo 


Ar i ,A s-(1 + A ri „A 8 . . 


A a 


(iy)dy. 


(13) 


For two bonds in a chain this must be positive or negative according as the bonds 
r8 and tu are separated by an odd or even number of bonds, as proved in (6) above. 

In the notation of II, therefore, the mutual polarizability of two bonds r, r +1 and 
t, t + 1 in a polyene chain (r < t) is: 


1 f* p-'Dr..yD n - t ^DrDn-t-X+P- T *'»r-x I>n rh A,-, ■/ 

JrJ — '. . JDJ ‘ y 

<■*> 

where each I) is taken as a function of iy. By similar reasoning to that used in II 
this may be proved equal to 


4i f £ - 1)(z 2r + 2 - 1) (z 2n ~*- 1) (z* n ~ * 2 - I) 

flnjo ..V n+a “ 1 ) 8 (^ 1 ) 


dz . 


(15) 


This expression may be simplified in special cases, e.g. : 

0 (a) For two bonds in the middle of a long polyene chain, «, r and n — t are all 
large; therefore 


4 i f*z*~* r dz 

0 irrr- 


( 10 ) 


This takes the values —0-273, 0-151, -0-104, 0-078, ..., xl//) when t—r equals 
1,2, 3, 4. 

(6) For two bonds in a long polyene chain, when one bond is at the end of the 
chain, the mutual polarizability is 

(-y-pg 

/for(2«-l)(2f + 3)' 

. This takes the values -0-26,0-12, -0-07,x 1//? when < equals 2,3,4. 


( 17 ) 




The electronic structure of conjugated systems. Ill 453 

Equation (14) does not hold when t — r, that is, for the self-polarizability of a 
bond in a polyene. For by equation (13) 


^r.r+li r,r+l 


If® A r> r A r+1( r+1 4- A®, r+1 
7rJ_„ A* 


{iy)dy 


* 


If 00 1 flU-1 

TrJ-oo K " " 


(*«/)%• 


This may be proved equal to 

2 i f < (z 2r - 1) (z 2r+2 - 1) (z 2 " ~ 2r - 1) (**»-*•+* - 1) 

frrjo (i 2 ^ 2 -l) 2 ( 2 2 -l) rf2 

2 »; r«**(**•- i )* («*»-» -1 )* 

+ Jn Jo”(**<•-l)*(*S=Tr ^ 


(18) 

* 


(19) 


(19) may also be simplified when n is large: 

(a) For a bond in the middle of a long polyene chain, when r and n — r are both 
large, equation (19) for the self-polarizability becomes 


2* f* dz 2* f* z Hz _ It 2\ 

aJo2 2 -1 + aJo- 2 “1 ~ A\ 7T/ 


« 0363//?. 


(6) For the terminal bond in a long polyene chain (19) becomes 


f*(z 4 - l)dz + |i f z»(z*-1)^2 = 8/16/Sff = 0-170//?. 

Disregarding self-polarizabilities, therefore, we see that in all the above special 
cases the mutual polarizability of a pair of bonds in a long polyene decreases with 
distance, as well as alternating in sign. 


1 

2 3 

4 

1 2 

3 4 

5 

6 


butadiene 


Figure 5 

hexatriene 




Butadiene and hexatriene (see figure 5). The bond orders in butadiene are: 
p l2 a 0-894 = p zv Pas 5=1 0-447; 

and in hexatriene: 

Pi% = 0*871 = p 23 = 0*483 « p 46 , p 34 = 0*784. 

The m.b.p. may be found from tables 3 and 4 (the figure given is n r8 fu x fi). 


Table 3. Butadiene 
1-2 2-3 3-4 

0*089 -0*179 

— 0*179 0*358 


3 o*a 


1-2 

2-3 


0*089 

-0*179 



464 


C. A. Coulson and H. C. Longuet-Higgins 

Table 4. Hexatriene 



1-2 

2-3 

3-4 

4-6 

8—6 

1-2 

0-124 

-0-211 

0*097 

-0*038 

0*028 

2-3 4 

-0*211 

0*370 

-0*183 

0*069 

-0*038 

3-4 

0*097 

-0-183 

0-167 

-0*183 

0*097 


Hero again, therefore, as in styrene, the higher the order of a bond, the lower is 
its self-polarizability. 

It should be remembered that all the above values of bond orders and m.b.p.'s 
have been calculated on the assumption that the resonance integrals of all the 
bonds are equal in all the molecules discussed. In part V we shall take into account 
variations in and see how this affects the calculated values of bond lengths and 
bond orders. 


Variation of bond orders with Coulomb terms 


Having considered changes in bond order arising from changes in resonance 
integrals, which in their turn depend on bond lengths, we may now enquire how 
bond orders change when the Coulomb term of one or more atoms of the system is 
altered, as for instance in chemical substitution. 

Now in an alternant hydrocarbon, by equation (21) of II, all bond-atom polariz¬ 
abilities are zero; that is, a small change in the Coulomb term of atom r produces 
only a second-order change in the order of bond si, It would therefore be roughly 
true to say, for example, that the bond orders in quinoline are equal to those in 
naphthalene. But in fact it is possible to make more precise deductions about the 
bond orders in a molecule in which the Coulomb term of one atom departs consider¬ 
ably from zero. 

Consider a moleoule such as aniline, derived from an odd-numbered alternant 
hydrocarbon by replacement of one carbon by a hetero-atom, the total number of 
rr-electrons being increased by one. Then for one atom (atom r, say) a will be negative, 
and by the result of II, p. 30, all the occupied mobile orders will have negative 
energies. It follows that we can apply I (42), viz.: 

<»» 


where A* refers to the hetero-moleoule. If A denotes the secular determinant for the 
parent hydrooarbon, this may be expanded to give 


Therefore 






— A«,t ^r,f+A. A„ iH t 1J* A„ ( A — o^A r , rt A r r 


( 21 ) 


-dy+ 


A*-a*A* r 


-dy. (22) 


A 2 —a®A® r 

It follows from II, table 1, that the numerator in the first integrand in (22) is an odd 
function, and the denominator an even function; therefore the first integral is zero. 



The electronic structure of conjugated systems . Ill 465 

As for the second integral in (22), both numerator and denominator are real, and 
by II (5), A a and — o# A* r have the same sign. Therefore the integrand lies between 

and A rfltrl {iy)l\ r {iy) t 

Therefore p# lies between 

and 

or their principal values. Therefore the mobile order of a bond in an alternant molecule 
with one hetero-atom has a value intermediate between its values in (a) the hydrocarbon 
obtained by replacement of the hetero-atom by a carbon atom y and (b) the hydrocarbon 
which remains when the hetero-atom is struck out. (23) 

For example, the bond orders in aniline lie between those in benzyl and those in 
benzene. This theorem only holds if corresponding resonance integrals are equal in 
the hetero-molecule and the related hydrocarbons, but if this restriction is borne 
in mind, not only is the result useful in fixing limits for bond orders in hetero¬ 
molecules, but it also provides an alternative picture of the changes in bond order 
which are thought to arise from the occurrence of subsidiary resonance states. Take 
pyridine for instance. From the resonance standpoint, all the structures in figure 6 
are supposed (e.g. Sidgwick 1937 ) on chemical grounds to contribute to the state of 
the molecule, causing, besides displacements of charge, differences in bond order 
from benzene: bonds a, and to a lesser extent bonds c, will have lowered orders, 
and bonds b will have slightly enhanced orders. By the mobile order theory, according 
to (23) the bonds in pyridine must have mobile orders intermediate between those 
in benzene, where complete conjugation occurs, and those in the pentadienyl 
radical, where there is no atomic orbital corresponding to the nitrogen atom. The 
actual values are given in table 6 , and are seen both to conform to (23), and to differ 
in the direction indicated by the resonance theory. 


+ 



Figure 6 . Resonance in pyridine. 


Table 5. Mobile bond obders in pyridine 

bonds a 

benzene 0-607 

pyridine 0*534 

pentadienyl 0 

As a final example let us consider acrolein. The electron affinity of oxygen is 
much greater than that of carbon, so that a for the oxygen atom in acrolein is large 
and negative. This causes the electron densities to be alternately large and small 
along the chain, as proved in II; and if the resonance integrals of all the unsaturated 


bonds 6 bonds c 

0*667 0*607 

0*703 0*039 

0*789 0*577 



450 C. A. Coulson and H. C. Longuet-Higgins 

bonds were equal, the bond orders in acrolein would necessarily have values inter¬ 
mediate between those in butadiene and those in the allyl radical, by (23). Such 
effects would correctly be described by the usual ^-symbols and curved arrows of 
the organic chemist (see figure 7 ( 6 )), which are intended to express the difference 
between a molecule such as acrolein and its parent hydrocarbon butadiene (Wheland 
1944 ). However, the resonance integral for the C=0 bond is certainly much larger 
than that of the C^C bond, and this means that the mobile orders of the first and 
third bonds in acrolein will tend to be increased at the expense of the second, from 
what has been proved about ig.b.p.’s in chain molecules. The direction of change of 
bond order as we pass from butadiene to acrolein is therefore governed by two con¬ 
flicting effects; and numerical calculations (Coulson 1946 ) show that in fact the 
0—C bond in acrolein has a lower order than the C—C bond in butadiene. Thus 
although the ^-symbols describe correctly the charge distribution in acrolein, the 
curved arrows give a false picture of the differences in bond order between this 
molecule and butadiene. 


(«) CHj 
5 f 

(&> CHj 

(c) CH a 


0-894 O.447 



0-707 0-707 

—CH ~- - ' 
Figure 7 


0-894 

-CH^====CH« 


* + 
-CH= 


0-819 




<5 - 

o 


CHj 


The divergence between the present semi-quantitative treatment and the quali¬ 
tative resonance picture arises entirely from the fact that the resonance integral of 
C=0 is large compared to that of C—C; however, in nitrogen compounds we should 
expect concordance between the two accounts, since the resonance integrals of 
Cb=:C and C=N are nearly equal. 


IV. Force constants and interaction constants in 
unsaturated hydrocarbons 

In this part it is shown that the force constant of a bond in a conjugated system depends 
upon its * self-polarizability ’ as well as upon its mobile order, and that the interaction constant 
between a pair of bonds is closely related to their ‘mutual polarizability*. The theory is 
applied to benzene, where good agreement is found with the scanty evidence available, and 
to naphtluUene and butadiene, for which at present the force constants are not known. 


Introduction 

In discussing the vibrational frequencies of a molecule in a given electronic level it 
is convenient to begin by dividing the energy of the molecule into two parts, namely 
S', the kinetic energy of the nuclei, and Y', the vibrational potential energy, which 





467 


The electronic structure of conjugated systems. IV 

depends solely upon the configuration of the nuclei. If q x ,q % , ... is any set of co¬ 
ordinates specifying completely the internuclear distances and angles, then V may 
be expanded as a Taylor’s series about any configuration q x ,q 2 ,... as follows: 




v.r 


()q r 


‘dq r dq, 


IVr- Qr) (7»-7») 


+ i S S S for T 7r> (7. - 7.) fol-7l> + • • •, (1) 

where c"f ',(q r denotes the value of dl~/dq r in the given configuration. 

Now if q x ,q t ,... refer to the equilibrium configuration, then the first sum on the 
right vanishes; and if the displacements q r — q r are small, the triple sum will be 
negligible compared to the double sum. Under these conditions, therefore 




(5*jT 


( 2 ) 


In general the co-ordinates q r may be chosen in several different ways. Usually 
a valence-force treatment (Bjerrum 1914 ) is adopted; that is, the potential energy is 
expressed in terms of the distances x r between bonded atoms and the angles between 
bonds to an atom, together with any other parameters which are needed to fix the 
configuration unambiguously, for instance the relative orientation of the two CH 3 
groups in ethane. d % i r jdx * is then referred to as the force constant of the rth bond, 
and jdx r dx 8 as the interaction constant of bonds r and s* 

In a valence-force treatment it often happens that the frequencies of vibration 
can be accounted for satisfactorily by assuming a potential function in which the 
interaction terms involving the product of two different co-ordinates are zero or 
very small. However, in many molecules it is necessary to introduce such cross 
terms into the potential function in order to secure agreement with observation. 
In a valence force potential function cross terms may obviously be of three types, 
representing interactions between two bonds, a bond and an angle, or two angles. 
A discussion of the different effects responsible for such interactions has been 
given by Coulson, Duchesne & Manneback ( 1947 ). However in this paper we 
shall be concerned exclusively with resonance interactions between pairs of 
bonds in a conjugated system. It will be shown that the mobile electrons in such 
molecules would be expected to give rise to interactions between unsaturated bonds, 
and that the interaction constants are closely related to the mutual bond polariz* 
abilities (m.b.p.’s) defined in I. 


* To avoid confusion it should be pointed out that in this paper the suffixes r and s in 
x r> fi rt ff r r and n r$g denote bonds, not atoms. Thus n r> , stands for the mutual polarizability 
of the bonds r and s, whereas in papers I and II it indicated the mutual polarizability of 
atoms r and s, and mutual bond polarizabilities were denoted by etc. The only exception 
to this scheme of notation is in the relation of the theory to benzene and naphthalene, where 
a previously existing notation has been incorporated in our comparison with experiment. 




458 C. A. Coulson and H. C. Longuet-Higgins 

Form of the vibrational potential energy 

In order to calculate force constants and interaction terms it is necessary to make 
some assumptions about the dependence of upon bond lengths and angles. The 
assumptions that we shall use will be the same as those originally introduced by 
Lennard-Jones ( 1937 ), and summarized by Lennard-Jones & Coulson ( 1939 ): we 
shall repeat their equations in this section for the sake of clearness. 

For any change of configuration in the carbon skeleton of an unsaturated hydro¬ 
carbon one may formally write 

+ (3) 

where is the energy of the cr-bonds and & the total energy of the Tr-electrons. 

If we ignore changes in energy due to angular distortion, and interactions between 
different <r-bonds, we may follow Lennard-Jones, and put 

^ 5 S^r-«) 2 . (*) 

r 

where <r, s are the force constant and length of a pure <r-bond, and x r is the length of 
the rth bond. £, on the other hand, is most conveniently expressed as a function of 
the resonance integrals of the unsaturated bonds, the resonance integral /? r of the 
rth bond depending only on its length x r . For this dependence Lennard-Jones 

2 /? r = const. + £k(# p - d ) 8 - icr(x r - s ) 8 , 

jjj = fe(av -d)~ £<x(z r -a), (5) 

where k, d are the force constant and length of a pure double bond, (cr and k in our 
notation stand for 2 k, and 2 K d in the notation of Lennard-Jones & Coulson.) 


assumed 
so that 


Bond lengths in equilibrium 


The conditions that the bond lengths shall have their equilibrium values x r 

are that gy/* 

■=— * 0 for each bond. 
ox r 


But by (3) 
and by 1(25), 


dr = ftp dAZf_ 

dx r ~ dx r + dx r dfi r ’ 

hi n 
dfi r " 2pr> 




( 6 ) 


where p r is the mobile order of bond r. It follows from (4) and (5) that the equilibrium 
bond lengths x r are given by 

cr(lx r -8)+p T {K(x r -d)-cr{x r -8)} = 0 , 

- ^ dKp f + 8<r(\-p r ) 

T Kp r + <T(l-p r ) ' 


whenoe 


(7) 



459 


The electronic structure of conjugated systems , IV 


This could be written (see Coulson 1939) 



and suggests that in quite general form, the relation between mobile order p and 
length x should be 

x « A h- 

1 + C 



where A , B and C are constants which will have different values for each type of 
bond, e.g. C—C, C—0, C—N. In view of the theoretical basis for such a formula 
as (8), and its satisfactory agreement with experiment, there does not seem to be 
any great advantage to be gained from introducing other entirely empirical relation¬ 
ships between x and p, as has recently been proposed (Kavanau 1944; Lagermann 
1946; Bernstein 1947; Gordy 1947). 


av 

dx* + dxl ’ 


Force constants and interaction terms 
By equation (3) the force constant of bond r satisfies the equation 

d*r 

dx* 

But & is a function of the /?’s, and each fi is a function of the corresponding bond 
length; therefore the right-hand side becomes 

a? + -m W, + w) W ■ "'* 1[K " ,)r ’ * 1iy - M*’ 

where n r r is the self-polarizability of the rth bond (see I). Substitution for x T from 
(7) give.' ? , r 11 I* 

W ,r(,- Pr) + *p, +1«*> 

Similarly, the interaction constant of bonds r and a is given by 

d*r a*jr av dp r dp, dw if K<r( 8 -d) 

*= *>- "1- — J= 5s o 71 n 

u i 


dx r 8 x, dx r dx, dx r dx, dx r dx^p r dfi t 2 |<r( 1 - p r ) +/cp r 


H K cr(s-d) ) 

iW(l-p,) + KpJ 

( 10 ) 

where n fi , is the mutual polarizability of bonds r and s. Equations (9) and (10) may 
be more simply written ^ 


and 


- _ 1 ■ ELt! 

8 x* ~ A,+ A* 

11*1 = El> £ 
dx r dx. A, A, ’ 

A, = cr(l-p r ) + KPr, A, = <T(l-p,) + Kp,\ 


H » $K®<r*(«— d)*. 


f 


( 12 ) 

(13) 


where 

and 



460 


C. A. Coulson and H. C. Longuet-Higgins 

The above equations have an interesting physical significance. The force constant 
of a bond appears as the sum of two terms, the first of which is linearly related to 
the bond order, whereas the second term depends principally on the self-polariz¬ 
ability of the bond, and has the form of an interaction constant. We see therefore 
that (a) the force constant of a bond depends on its self-polarizability as well as on 
its mobile order, and (b) the interaction between two bonds is proportional to their 
mutual polarizability, but also depends on their mobile orders. 

Application to benzene 

One of the simplest molecules to which equations ( 11 ) and ( 12 ) may be applied 
is benzene, where there is only one kind of CC bond and only three types of inter¬ 
action constant between different CC bonds. 

Let us number the atoms in benzene consecutively round the ring, 1 to 6 . Then 
by III the m.b.p/s are 

_ 13 _ “11 7 _ -5 

tfi-M-a - 54^’ ^-2.2-3 - 54/ y 3 *1-0-4 ~ 54 ^> "l-2.4-5 - 54 ^ (I*) 

and the mobile order of every bond is 2/3. 

In what follows we shall use the notation of Manneback ( 1935 ) for the force 
constants in benzene. He writes 

vr d*r ' d*r wr _ 

2*i-a ’ 3*1-2 ^ 2-3 °’ dx^ 2 dx^ 6 p ' 

In calculating F, F 0 , F m and F p we substitute into equations (9) and ( 10 ) the following 
experimentally determined values: 

s = 154 A, <r « 4*96 x 10 8 dyne/cm., 
d = 1*33 A, k = 9*80 x 10 6 dyne/cm., 

these figures being those used by Lennard-Jones & Coulson, and 

/} = ~ 20 Cal./mole ~ - 1*39 x 10 ~ 12 erg, 

from measured resonance energies (Wheland 1941 ). The results are: 

F ^ 6*84, F 0 = 1*14, F m = - 0*73, F p = 0*52 x 10 6 dyne/cm. 

These figures may be compared with those given by Duchesne & Penney ( 1939 ) 
on the basis of the vibrational spectrum of benzene, viz.: 

F~ F 0 ~ F m + F p ~ 5*78 (calc. 6*95) x 10 8 dyne/cm. 

$ F + 2F 0 +2F m + F p = 7*57 (calc. 8*19) x 10 6 dyne/cm. 

and, with certain auxiliary assumptions, including that F p * 0 , 

F+ F 0 — F tn — F p a 7*50 (calc. 8*19) x 30 6 dyne/cm. 

F — 2Fq 4* 2F m — F p « 4*33 (calc. 2*58) x K^dyne/cm. 



The electronic structure of conjugated systems. IV 461 

Although the data are scanty, agreement is fair, particularly as there is con¬ 
siderable uncertainty as to what values should be taken for s, ri, <r, k and /?. For¬ 
tunately, however, we have been able to show, by taking somewhat altered numerical 
values for some of these quantities, that even if different values were chosen, the 
calculated cross-terms would still be of the right order of magnitude. In the last 
force constant the discrepancy may be partly due to the inaccuracy of the secondary 
assumptions made by Duchesne & Penney in evaluating the empirical cross-terms; 
in particular, it cannot bo correct to assume that F p » 0. A better assumption would 
be that F Q vF m : = — 11:7: — 5 as suggested theoretically by (12) and (14). It is 

worth noting that whatever values are taken for rr, at , s and rf, the effective force 
constants F + F 0 - F m — F p and F + 2F 0 + 2 F m + F p must be equal since both reduce 
to /rp-f <t(1 — p) owing to the relative values of the m.b.p.’s in benzene; and this is 
confirmed by the empirical results, within experimental error, for the two values 
are 7*50 and 7*57 x 10 5 dyne/cm. respectively. 

While the assumptions of the present theory seem to be more or less valid for 
benzene, it should be realized that benzene is a particularly favourable test case, for 
the following reasons: 

(а) In calculating bond orders and mutual bond polarizabilities—and hence force 
constants and interaction terms—it is necessary to assume the resonance integrals 
of all the unsaturated bonds equal, in order to avoid unmanageable mathematical 
complexities. Almost the only hydrocarbon for which this is true is benzene, since 
in other molecules the unsaturated bonds will differ in mobile order and therefore 
in length and resonance integral. 

(б) In deriving equations (11) to (13) we have neglected changes in bond angle, 
and in five of the planar vibrations of benzene the ring angles are unchanged. 

( c ) Equation (5), which implies a parabolic variation of ft with bond length, will 
probably not hold over a wide range of x , but this does not matter in benzene, since 
even when the molecule is vibrating the differences between the bond lengths are 
small. 


* Application to naphthalene 

In view of the structural similarity between benzene and naphthalene, one might 
expect the approximations of the present theory to be almost equally valid for both 
molecules; and since at present no experimental values of CC force constants or 
interaction terms are available for naphthalene, this molecule affords an instance 
in which the quantitative predictions of the theory may be verified a posteriori. 

It is most convenient to consider separately the potential functions for the 
different symmetry classes. As we are concerned with changes in bond length, the 
only symmetry classes we need consider are those in which the plane of th^molecule 
is a plane of symmetry, viz. A^ B w B iu and B^* If we use the conventional num¬ 
bering for the atoms in naphthalene, and denote the changes in bond length during 

♦ The notation is that of Herzberg { 1945 , p. 108), where the z-axis, called 1, is perpendicular 
to the molecular plane, and the y-axis, called 2 , lies along the common bond of the two rings. 



4 02 C. A. Coulson and H. C. Longuet-Higgins 

a vibration by Az^, etc., then the following relations hold in the above symmetry 
classes: 

A„ Ax fi _ 10 * a, say. 

= Axg_g — Axj # _ 4 = Aziq _j = A| say, 

Axj_ 3 — Ax 3 _ 4 31 — • Ax 7 _ 8 = c, say, 

Ax a _ s = Ax a _ 7 = d, say. 

B w Ax t _ 10 — 0, 

&z„-i = -Ax 8 _ 8 = ~Ax 10 _ t = Ax 10 _ s = 6, say, 

Aa;ji _2 == - Ax 3 _ 4 = Ax 6 _„ = - Ax 7 _ 8 = c, say, 

Ax2—3 “ Ax 6 _ 7 —- 0. 

•®2u A.Xg_ 10 = 0, 

Az»_i = Ax 9 _ 8 = - Ax 10 _ 4 = - Ax 10 _ 6 = 6, say, 

Axj_ 2 - - Ax 3 _ 4 = - Ax 5 _a = A* 7 _ s = c, say, 

Ax a _ 3 = Ax 6 _ 7 = 0. 

B 3u Ax 9 _ 10 = 0, 

Axg„-j = Axg_g = A-^io—* = Ax 10 _ s = b, say, 

A^i_i — Ax 3 _4 ® Ax‘g_ 3 == Ax 7 _ 8 — c, ^ay, 

Ax a _ s = — Ax # _ 7 = d, say. 

If the potential energy 1 r in (2) is now expanded in terms of a, b, c and d, a similar 
calculation to that for benzene gives the following effective force constants and 
interaction terms: 


0*1r/0a» 

A lg 

5-8 


B tu 


W/SaSb 

2-9 

— 

— 

— 

0*1 ^/da 8c 

—1*8 

— 

— 

— 

Pf/dadd 

0*53 

— 

— 

— 

S*r~ldb* 

25 

23 

29 

16 

0*lT/0b0c 

4*0 

3*3 

1*7 

7*9 

0*1 r/dbdd 

-1*7 

— 

— 

-3*9 

0* 1T/0C* 

28 

32 

32 

26 

d'lT/dcdd 

4*0 

— 

— 

6*4 

pr/dd.' 

13 

— 


12 


Effective force constants and interaction constants in naphthalene (units of 10* dyne/cm.) 

Although the parameters used in the calculations were the same as those used for 
benzene, the figures above would probably not be greatly affected if the parameters 
were aaslfhed slightly different values. 

Attention is drawn to the signs of the interaction constants, no less than three of 
which are negative; and to their high values in the vibrations of speoies B&. This 
table also shows how dangerous it is to speak of ‘the force constant for a bond’. 
If, for example, there were no interaction between the bonds 9-1, 9-8, 10-4 and 



463 


The electronic structure of conjugated systems, IV 

10-5, all four entries in the row labelled d 2 ^jdb * would be equal, although the letter 
b represents different normal oo-ordinates in the different symmetry classes. In fact, 
d^/db 2 varies over almost a factor of two. There is a close parallel here to some 
work (Thorndike, Wells & Wilson 1947 ) on the intensities of infra-red absorption 
where, for example, it is shown from a careful analysis of experimental observation 
that the dipole moment of C—H in ethylene has effective values that range from 
0*37 to 0*77 Debyes according to the type of vibration that is being excited. We 
are forced to the conclusion that, particularly in conjugated systems, neither the 
dipole moment nor the vibrational potential energy of a molecule are sums of 
contributions from the separate lengths and angles, but that both quantities 
involve also significant cross terms between these co-ordinates. 

It will also be noticed that the interaction terms are sometimes quite large, even 
up to 2/3 of the force constants. Evidently no empirical representation of the force 
fields of molecules of this type is likely to be found which does not include a large 
number of such cross-terms. 


Butadiene 

There are only two kinds of vibration involving changes in the CC bond lengths 
in butadiene. In the first 

Aa *!„ 2 = A# 3„ 4 ~ a, say and Aa ^ 3 = 6 , say; 

and in the second 

Ax^a « - Ax 3 _ 4 = a, say and Ax 2 _ 3 = 0 . 

Using the same parameters as in the benzene calculations, and the mutual bond 
polarizabilities given in III, one obtains the following constants: in the former type 
of vibration 

d 2 l Tjda 2 = 17, Wr'jdb 2 = 4*4, d 2 r/dadb = 2*1 x 10 6 dyne/cm. 
and in the latter type d 2 '^jda 2 » 18 x 10 5 dyne/cm. 

Although these figures are qualitatively reasonable, their actual values are probably 
not very reliable, for the following reasons. First, the bond orders in butadiene 
differ considerably, which means that it is a bad approximation to assume the 
resonance integrals equal in calculating bond orders and m.b.p.’s. Secondly, equa¬ 
tions (4) and (5) cannot be expected to hold for differences in bond length as large as 
those in butadiene. Finally, in view of the above uncertainties it is difficult to 
decide what is the best value to assume for /? in evaluating the m.b.p.’s, which appear 
as multiples of 1/fi. We have therefore used the benzene value for uniformity. 

Conclusions 

The conclusions to be drawn from the present investigation seem to be as follows: 
(i) The force constant of an unsaturated bond in a conjugated system%ill depend 
on its self-polarizability as well as on its mobile order. 



464 C. A. Coulson and H. C. Longuet-Higgins 

(ii) A pair of unsaturated bonds in a conjugated system will in general interact, 
the interaction constant being proportional to the mutual polarizability of the bonds, 
but also involving their mobile orders. 

(iii) Owing to the nature of the approximations occurring in the theory, one would 
expect the equations to hold most exactly for molecules in which there is not much 
variation in mobile order between the bonds; indeed, the theory is in moderately 
good agreement with the experimental data for benzene. 

(iv) The theory takes no account of interactions between <r-bonds. The fact that 
in many saturated molecules (e.g. CC 1 4 ) the observed frequencies of vibration 
imply considerable cross-terms in the potential function (Herzberg 1945 ) means 
that this will not in general be a valid approximation; in fact, preliminary 
work (unpublished) on the chloroethylenes indicates that the present theory is not 
general enough to account for thbir potontial functions. But this does not seem 
to apply to hydrocarbons; for instance, in CH 4 , a simple valence-force potential 
function without cross-terms gives a satisfactory interpretation of the vibrational 
spectrum. 

To sum up, although the present theory takes no account of angular strain or 
<r-bond interactions, its comparative success when applied to benzene suggests that 
in unsaturated hydrocarbons the interactions between the unsaturated bonds arise 
primarily from the mobile electrons. 


References 

Bernstein, H. J. 1947 J. Chem. Phya. 15, 284. 

Bjerrum, N. 1914 Verh. dtach. phya. Qea . 16, 737. 

Coulson, C. A. 1939 Proc. Roy. S foe. A, 169, 413. 

Coulson, CY A. 1946 Trans. Faraday 80c . 42, 106. 

Coulson, C. A., Duchesne, J. A Manneback, C. 1947 Nature, 160, 793; V. Henri Memorial 
Volume, Ltege, 1948. 

Coulson, C. A. A Longuet-Higgins, H. C. 1947 a Proc. Roy. Soc. A, 191, 39 (paper I of this 
series). 

Coulson, C, A. & Longuet-Higgins, H. C. 19476 Proc, Roy. Soc . A, 192, 16 (paper II of this 
series). 

Duchesne, J. A Penney, W. G. 1939 Bull. $oa, Roy . Sci. Li&ge, 11 , 514. 

Gordy, W. 1947 ./. Chem . Phya. 15, 305. 

Herzberg, G. 1945 Infra-red and Raman spectra, pp. 182-3. New York: Van Nostrand Co. 
Kavanau, J. L. 1944 J. Chem, Phya . 12 , 467. 

Lagermann, R. T. 1946 J . Chem. Phya. 14, 743. 

Leonard-Jones, J, E, 1937 Proc. Roy . Soc, A, 158, 280. 

Lennard-Jones, J. E, A Coulson, C. A. 1939 Trans . Faraday Soc. 35, 811. 

Manneback, C. 1935 Ann. Soc. Sci, Bruce. B, 55, 129, 237. 

Sidgwick, N. V., Taylor, T. W. J. A Baker, W. 1937 The organic chemistry of nitrogen , 
p. 523. Oxford University Press. 

Thorndike, A. M., Wells, A. J. A Wilson, E. B. 1947 J . Chem. Phya. 15, 157. 

Wheland, G. W. 1941 J. Amer. Chem. Soc. 63, 2025. 

Wheland, G, W. 1944 The theory of resonance , p. 248. New York: Wiley and Sons. 



The thermal etching of silver 

By B. Chalmkrs,* R. King and R. SHUTTLEWORTHf 

Royal Aircraft Establishment , South Farnborough, Hants , 

(Communicated by E. N. da C. Andrade i F.R.S.—Received 14 August 1947— 

Read 12 February 1948) 

[Plates 9 to 12] 


The mechanism whereby grain boundaries are delineated and striations formed on polished 
surfaces of heated metal specimens has been examined, Experiments on eloctrolytioally 
polished silver show that grooves form at gram boundaries at temperatures as low as 300° C 
and striations at 500° C in air. Striations only appear in the presence of oxygen and may be 
removed by heating in nitrogen. 

A furnace for high-temperature photomicrography, suitable for specimen temperatures up 
to about 950° C, is described. 

Previous theories are found inadequate to explain the effects observed in silver, and a theory 
which regards the surface etching as an approach to equilibrium by the reduction of surface 
free energy is suggested. Thus the equilibrium condition of the boundary is a groove whose 
shape is determined by the rolative magnitudes of the free energy per unit area of the boundary 
and thewsurface energies of the crystalline faces meeting the boundary in the surface of the 
specimen. The striations are caused by the development of those crystalline planes having 
the lowest free enorgy; the relative surface energies of different planes being modified by the 
presence of oxygen. It is suggested that the chief means whereby the boundary grooves are 
formed is that of surface inigratton of ions: both surface migration of ions and evaporation 
of silver oxide molecules may be expected to play major parts in the formation of the 
striations. 


1. Introduction 

The nature and the properties of the material constituting the boundaries of the 
crystal grains in polycrystalline metals are of profound importance in the study of 
metals, yet there exists comparatively little detailed information concerning them. 
As part of a general investigation of the grain boundary, experiments were carried 
out to examine the nature of the etching effect known to occur frequently on 
polished surfaces of metals when they are heated. When conditions are such that 
this etohing occurs, the grain boundaries are delineated and it was felt that an 
understanding of the mechanism of this phenomenon would provide valuable 
information concerning their nature. Under some conditions of heating, polished 
surfaces show, besides grain-boundary markings, striations which cross the different 
crystal grains in different directions. As, in the course of the experiments on the 
grain-boundary etching, the opportunity arose to observe this effect as well, it was 
examined and some additional experiments designed specifically to provide informa¬ 
tion concerning it were performed. 

* Now at Atomic Energy Research Establishment, Harwell, Berks, 
t Now at H. H. Wills Physical Laboratory, University of Bristol. 

[ 465 ] 



466 


B. Chalmers, R. King and R. Shuttleworfch 

The experiments were all carried out on silver and it was found that none of the 
explanations previously put forward for the thermal-etching process was completely 
satisfactory for the results obtained. A tentative explanation consistent with 
published results is suggested. 

2. Previous observations upon thermal etching* 

* 

At the outset it is necessary to distinguish clearly between the phenomena of 
thermal etching and those of 4 thermal fatigue * recently described by Boas & Honey- 
combe (1946, 1947). There are distinct similarities between the appearance of 
specimens which have been subjected to ‘thermal fatigue’ and that of specimens 
thermally etched. In both cases the grain boundaries are delineated upon what 
was initially a polished surface, and striations develop which run across the grains 
themselves. Despite the similarity in appearance of the specimens, however, there 
is little doubt that the phenomena are different. The phenomena of thermal fatigue 
were produced in Boas & Honeycombed experiments by subjecting specimens of 
the non-cubic metals, tin, zinc and cadmium to cycles of heating and cooling, and 
were not observed when specimens of lead, which is cubic, were similarly treated. 
Boas & Honeycomb© explained their results by the fact that the non-cubic metals 
expand and contract anisotropically when heated and cooled and thus in a randomly 
orientated aggregate of crystal grains stresses are set up with change of temperature. 
These stresses are sufficient to cause slip in certain crystals and the striations 
observed are in fhct slip lines. The stresses also cause changes in level between 
different grains and the grain boundaries are delineated. With increasing numbers 
of cycles of heating and cooling the striations become more numerous and if grain 
growth occurs, there are formed, in addition to the old grain-boundary outlines, 
new lines which coincide with the new grain boundaries. In this way a series of 
boundary networks is formed, one corresponding to each cycle. The phenomena 
observed appeared to be independent of the environment in which the cyclic 
heating took place. 

Boas & Honeycomb© give much additional evidence in support of their explana- 
tion and ‘thermal fatigue’ is undoubtedly the explanation of the delineation of 
the grain boundaries observed by Carpenter & Elam (1920) when they heated and 
cooled specimens of an antimony-tin alloy. 

As distinct from ‘ thermal fatigue ’ the phenomena of thermal etching are observed 
in many cubio metals, they can be produced by maintaining the metal at an 
elevated temperature without any cyclic heating and cooling, and they are affected 
by the environment in which the metal is heated. The grain boundaries are delineated 
not by difference in level between adjacent grains but by grooves formed at the 
intersection of the boundary with the surface. The striations are corrugations or 
a stepped structure of the surface. 

* [Note added in proof.] Previous observations upon thermal etching have recently been 
reviewed; Shuttleworth, R., July, 1948 , MetaUurgia. (In course of publication.) 



Thermal etching of silver 467 

During the past thirty years thermal etching phenomena have been observed by 
numerous workers. Networks of grooves were observed on iron heated in a vacuum by 
Rosenhain. & Humfrey (1910) and on iron heated in hydrogen by Rawdon & Berg- 
lund (1928) who also observed striations. Grain-boundary networks on steels were 
observed by Kroll (1912), Humfrey (1912), Rawdon & Scott (1922) and Hemingway 
& Ensminger (1922). The phenomenon has been used (Day & Austin 1940; Miller 
& Day 1942) for the determination of austenitic grain size. Parker & Sraoluchowski 
(1945) observed that molten silver spread along the boundary grooves on a 
molybdenum steel more rapidly than across the crystal faces. Where explanations 
of the phenomena were offered they were attributed to selective evaporation. 

Striations were observed on the surfaces of cast specimens by Ewing & Rosenhain 
(1900) and Graf (1942). 

Specific attention has, however, been directed by few workers towards the 
understanding of the phenomena. The formation of the boundary grooves was 
considered by Rosenhain & Ewen (1912), who heated polished specimens of copper, 
silver and zinc in a vacuum and found that grooves developed at grain boundaries 
but not at twin boundaries. They observed that when heated under otherwise 
identical conditions, specimens with small grain size evaporated more rapidly than 
those with large grain size. They interpreted their experiments by supposing that 
metallic grains are joined by a cement of amorphous metal which has a greater 
vapour pressure than the crystalline metal. On heating in vacuum the amorphous 
metal exposed at the boundaries evaporates more rapidly than the grains, thus 
producing the boundary grooves. They attributed the greater loss of weight from 
the small grained specimens to the fact that they exposed more grain boundary 
material than did the specimens of large grain size, and also as the grooves formed 
the effective area was increased. 

They also suggested that the grooves at the boundaries could have been produced 
in part by the rearrangement of the mechanically polished surface layer to conform 
with the crystalline structure beneath. 

Fonda (1923), working on the rate of evaporation of tungsten filaments at 2677 ° C, 
found that grooves developed at the grain boundaries both in inert atmospheres 
and in a vacuum. In a vacuum the rate of evaporation from filaments with small 
grain size was 1*4 times that from filaments whose grains were 4 to 100 times 
longer, and Fonda attributed the phenomena he observed to the greater volatility 
of the grain boundary material. 

Cases of the formation of striations were examined by Johnson (1938) and 
Gwathmey & Benton (1940a, 6,1942) who explained their formation by the migration 
of ions or the rearrangement of atoms and by Leroux & Raub (1930) and Elam 
(1936) who ascribed the phenomena they observed to oxidation of the surface. 

Johnson (1938) observed striations on tungsten filaments, the surfaces of which 
he examined in considerable detail, after they had been heated under various 
conditions. He found that the surface structure produoed by the heating depended 
upon the nature of the atmosphere and the heating current. Thermal gradients in 


Vol. 193 . A. 


31 



m B. Chalmers, R. King and R. Shuttleworfch 

the filaments modified the surface structure. Stepped structures which occurred 
with D.C. heating, and were also caused by temperature gradients, Johnson supposed 
were produced by the surface migration of positive tungsten ions in the eleotric 
and thermal fields, the rate of drift depending upon the crystallographic direction. 
A subsidiary experiment showed that thorium migrated over the surface of tungsten 
in an ionic form, the direction of migration being influenced by an applied electric 
field. Johnson also found that tantalum, molybdenum, platinum, iron and nickel 
filaments roughened when heated with d.c. Mrowca (1943) observed very similar 
results on tantalum ribbons; striations appeared with d.c. heating and when there 
were thermal gradients. He explained the striations in the same way as Johnson. 

Gwathmey & Benton (1940a, 6, 1942), examining the reaction of gases on the 
surface of copper, used a spherical single crystal of copper whose surface was 
obtained unoxidized in a strain-free condition by electrolytio polishing followed by 
annealing in hydrogen. They found that heating in a vacuum of 10“ 4 mm., or in 
hydrogen at atmospheric pressure, caused no change on the surface of the crystal, 
but if the sphere was heated in air at pressures and temperatures which produced 
only a thin oxide film, an oxide pattern was produced which indicated that different 
crystallographic planes are oxidized at different rates. A sphere heated to 1,000° C 
at a pressure of 0-3 mm. showed, beside this differential oxidation, circular striations 
spreading out to 3 to 5 mm. diameter around the ( 110 ) and (111) poles of the 
crystal. The striations were caused by the development of {110} and {ill} planes 
at these positions. When the sphere was heated to 580 ° G in hydrogen the oxide 
film disappeared and the striations were diminished. 

From these experiments Gwathmey & Benton concluded that at temperatures 
below the melting point rearrangements of the surface atoms can occur to produce 
facets with simple crystallographic indices. 

Leroux & Raub (1930) carried out careful weighing experiments upon specimens 
of silver heated to temperatures of 600 and 750 ° C in a stream of oxygen and in 
a stream of air, and observed that there was a loss of weight whioh they attributed 
to the formation of an oxide whioh they suggested was Ag a O, and which was 
volatile at the temperatures of their experiments. Striations were formed on the 
surface of the specimens. When the specimens were heated under similar conditions 
in hydrogen there was no measurable loss in weight and no striations were formed . 
They concluded, therefore, that the rate of oxidation was different for different 
crystallographic planes of the silver and that the striations were due to the develop¬ 
ment of these planes. Graf (1942) correlated the striated structure to that observed 
when metals solidified from the molten state. 

Elam (1936) heated a mechanically polished specimen of copper containing 
cuprous oxide, in a vacuum of about 0*5 mm., to a temperature between 900 and 
950 ° C, and found that striations appeared; oxygen-free copper heated under 
similar conditions developed no striations. If, however, the surface of the oxygen- 
free copper was first oxidized and the specimen then heated in a vacuum to 900 ° C 
the oxide disappeared to leave a striated surface. There was a nett loss pf weight 



Thermal etching of silver 409 

indicating evaporation of the oxide in a molecular form. It was found .that the 
formation of striations was very sensitive to surface preparation, and even prolonged 
heating at 950 ° C did not remove the effects of the mechanical polish. 

Elam found that if an etched face was polished and reheated, the striations 
reappeared in the same directions and that their spacing was the same. Separate 
specimens out from the same single crystal also showed the same striations on 
corresponding faces. Where striations appeared on three faces of a single crystal 
they could not be identified as the intersection of a single plane with those three 
surfaces, although the directions of the lines on single faoes were consistent with 
their being the intersection of {100} or {110} planes with that surface. 

The mechanism suggested by Elam for the formation of the striations is that 
when the surface of copper is oxidized the rate of oxidation is greatest for the {100} 
and the {110} planes. When the specimen is then heated in a vacuum to a high 
temperature, the oxide evaporates leaving a striated surface. In the case of the 
copper containing cuprous oxide, the oxide decomposes on heating and the oxygen 
diffuses to the surface, which it oxidizes. The striations are then produced as in the 
case of the speoimen whose surface was initially oxidized. 

3. Experimental procedure 

A number of experiments were performed to examine the thermal etching of 
high purity silver. Preliminary experiments indicated that the phenomena were 
very sensitive to surface preparation, and attention was concentrated upon finding 
a method of preparation which gave consistent, reproducible results. A method 
of electrolytic polishing was found most satisfactory and a series of experiments 
was carried out on specimens prepared in this way to examine the effects of 
temperature and various atmospheres on the etching process. To examine the 
surface of silver specimens when they were at elevated temperatures a furnace was 
designed to enable microscopy to be carried out at high temperatures. 

(a) Selection of material 

Silver was selected as the material for the first experiments on thermal-etching 
phenomena for a number of reasons. Since the grain boundaries are concerned in 
the etching, purity of the metal is of great importance, as quite a small average 
impurity content might, if concentrated at the grain boundary, have a large effect. 
Silver may be obtained in a condition of high purity. The silver used was supplied 
specially by Johnson, Matthey and Co. Ltd., who cast it in a vacuum and then cold 
forged and cold rolled it into 8s.w.g. sheet. Analysis showed 0*00010 % iron, 
0*00014 % manganese and faint traces of magnesium, calcium, silicon and lead. 

In oxygen at atmospheric pressure silver oxide (Ag a O) in bulk is unstable and 
decomposes above 200° C. This reaction is reversible, but below 200° C the rate of 
oxidation is small, becoming negligible below 100° C (Lewis 1906; Keyes & Hara 
1932; Benton & Drake 1932, 1934). 


31-2 



470 


B. Chalmers, R. King and R. Shuttleworth 

As the crystal structure of silver is cubic its thermal expansion is isotropic and 
no stresses are set up when a polycrystalline specimen is heated or cooled provided 
there are no thermal gradients in the specimen. 

(b) The preparation of specimens 

Initial experiments were carried out using silver which had been ground on 
emery papers and then mechanically polished on velvet with a commercial metal 
polish. Because of the softness of the silver considerable difficulty was experienced; 
for reasonably reproducible results a tedious process of alternately polishing and 
chemically etching was needed and even after this process it was found (in agree¬ 
ment with the results of Elam (1936) for copper) that the effects of grinding were 
not removed by heating to a temperature near the melting point. 

Finally a method of electrolytic polishing was found, which gave reproducible 
results and all specimens were prepared by this process. The prooess is described 
elsewhere (Shuttleworth, King & Chalmers 1947). 

(c) The effect of temperature on the etching 

To examine the effect of temperature on the etching of silver, specimens were 
heated in a fused silica tube open to the atmosphere. The tube was placed in an 
electric furnace already set at the required temperature. A thermocouple placed 
near to the specimen was used to indicate the specimen temperature and the furnace 
was controlled to maintain the temperature to within ± 1° C. The temperatures 
examined were 300 , 500 , 700 , 850 and 940 ° C and figure 2, plate 9 ( x 250 ) shows the 
structures exhibited on examination when cool after 11 hours at these temperatures. 
Figure 3 , plate 9 ( x 1500 ) shows the structures for 500 , 850 and 940 ° C at a higher 
magnification. 

Grooves have appeared at the grain boundaries and striations have occurred on 
the grains themselves. The striations change their direction at grain and twin 
boundaries. Extensive grain growth has ocourred at the higher temperatures and 
the proportion of twinned crystals is higher at these temperatures. At 300 ° C no 
striations have appeared and the grain boundaries are incomplete. At 500 ° C the 
boundary grooves are complete and striations have begun to appear. The pro¬ 
portion of grains striated is seen to increase with temperature and above 850 ° C 
nearly all the grains are striated. At the higher temperatures the striations are 
more closely spaced and a second set of striations, inclined to the first, appear on 
some grains. The striations which form at the lower temperature are not as straight 
and regular as those formed at the higher temperature a few ciroular striations are 
shown in figure 3 a ( x 1500 ). At the lower temperatures; the curvature caused by 
the boundary grooves appears to extend over an appreciable area of the grain 
surface; if the striations were the traces of specific crystallographic planes, they 
would be curved on a curved surface. Gwathmey & Benton (1940a, b) observed 
circular striations on spherical crystals of copper. 



Thermal etching of silver 471 

(d) The effect of atmosphere on the etching 

Specimens were heated at 020° C in air, in oxygen, in pure nitrogen and in 
a vacuum of better than 10~ 4 mm. of mercury. The specimens were onoe more 
heated in a silica tube. The required atmosphere was established in the tube before 
it was placed in the furnace which was already at the required temperature. The 
specimen reached the required temperature in about 10 min. After the period of 
heating the tube was allowed to cool in air and the specimen cooled to 100° C in 
about 10 min. In all cases the atmosphere was maintained until the specimen 
cooled to room temperature. 

Ordinary commercial oxygen was used and oxygen-free nitrogen. The nitrogen 
contained less than 10 vol./million of oxygen, less than 50 vol./million of hydrogen 
and less than 0*02 g./cu.m. of water vapour at 120 atm. pressure. 

Typical photomicrographs of surfaces after these tests are shown in figure 4 , 
plate 10 (x 250 ). 

In all cases extensive grain growth has occurred. In air (figure 4 a) grain boundaries 
have formed and strong striations have appeared. Similar results are shown in 
figure 46 for specimens heated in oxygen, but in general the striations are more 
closely spaced and frequently a second set of striations appears inclined in direction 
to the first. In some cases a third set appears. Figure 5 , plate 10 ( x 2000) shows 
the complicated structure developed in oxygen. In nitrogen, grooves appear at the 
grain boundaries and much less noticeably at twin boundaries; no striations appear. 
In a vacuum the grain boundaries show as grooves. No striations are produced, but 
the surface rapidly becomes rough. Figure 4 d ( x 250 ) shows the surface after only 
l\ hr. in a vacuum at 020° C, and is therefore not strictly comparable with the other 
photomicrographs. It may, however, be compared with figure 8^, plate 12 which 
shows the surface after heating in air for 1 j hr. at 920 ° C. 

Some attempts were made at heating in hydrogen, but the results were un¬ 
satisfactory because the hydrogen was of doubtful purity. The results did, however, 
show no evidence of striations. 

To verify that the boundary markings observed were in fact grooves as suggested 
by microscopic examination and not differences in level between adjaoent grains, 
electron microscope replica photographs were obtained of a sample heated in 
nitrogen. The photographs confirmed that the markings were grooves below the 
level of both grains. 

(e) The reversibility of striation etching 

Because striations only appear in atmospheres containing oxygen, experiments 
were carried out to determine whether striations could be made to disappear by 
heating in nitrogen after heating in air (Shuttleworth, King & Chalmers 1946). 
Specimens were first heated in nitrogen until grain growth was only proceeding 
slowly, then in air for a sufficiently long period for striations to form, then in 
nitrogen for various lengths of time. Figure 6a, plate 11 ( x 250 ) shows the surface 
after 11 hr. in nitrogen at 920 ° C. Figure 66 shows the same surface after heating 



472 B* Chalmers, R. King and R. Shuttieworth 

for 1 hr. in air at the same temperature; striations have formed. Figures 6c, A and 
e show the surface after a further heating of 1, 3 and 11 hr. respectively in nitrogen; 
the striations have gradually disappeared. Figure 6/ shows the effect of heating 
for a further period of 1 hr. in air. The striations have appeared in the same 
directions as before, but are more closely spaced. 

During the preliminary anneal in nitrogen there was extensive grain growth and 
subsequently little boundary migration occurred. However, boundary migration 
restarted after figure 6 c was taken and scars were seen to remain in the positions 
where the boundaries had halted. One scar appears on figure 6d and is indicated by 
the arrow. The scars gradually disappeared on further heating in nitrogen. 

(/) High-iemperature microscopy 

In the previous experiments the specimens were all examined after cooling. In 
order to determine whether the boundary grooves and striations appeared whilst 
the silver was at the high temperature or whether they appeared during the 
processes of heating or cooling, specimens were examined continuously while they 
were maintained at high temperatures. The furnace constructed for this purpose 
is described in the appendix. 

Figure 8, plate 12 shows a series of photomicrographs taken while a specimen was 
heated up to 920 ° C and maintained at that temperature. Figure 8a shows that 
after only 6 min., by which time the temperature had reached 575 ° C, the boundaries 
had appeared. Striations may be seen in figure 8c which shows the surface after 
19 min. when the temperature had reached 850 ° C. The photomicrographs clearly 
demonstrate that both grain boundaries and striations are present at the high 
temperature and are not formed by the cooling process. 

The technique provides a convenient means of observing grain growth con¬ 
tinuously. Grain growth is seen to occur by the migration of the boundaries; the 
grooves move with the boundaries, leaving in general no scars at previous positions. 
Occasionally a network of scars is left and it is believed that the network coincides 
with a position of the boundaries where for some reason grain growth was temporarily 
halted. 

When silver of commercial purity is heated in air, grain growth is retarded at the 
surface owing to oxidation of impurities at the grain boundaries, but Chaston (1945) 
has shown that in high-purity silver, grain growth proceeds at the same rate at the 
surface as in the centre of the material. Figure 8 should therefore be typical of what 
happens at the centre. 

(g) The direction of the striations 

As the striations appeared to be due to the development of specific crystallo¬ 
graphic planes at the specimen surface, the following experiment was carried out 
to provide information concerning the crystallographic indices of those planes. 
A specimen was first heated in nitrogen at 920 ° C until grain growth was proceeding 
only slowly, and then heated in air at 920 ° C for £ hr. to produce striations. 
A selected field on the specimen was then photographed and the specimen heated 



Thermal etching of silver 473 

in nitrogen at 920 ° C until the striations had completely disappeared. The specimen 
was then compressed in a vioe to produce a little plastic deformation and to show 
slip lines. The same field of the specimen was then photographed once more. Results 
are shown in figure 7 , plate 11 (x 250 ). Figure 7 a shows the thermal etching 
striations. Figure 76 shows the slip lines. In some of the crystals there were more 
than one set of slip lines, but in all cases there was one set parallel to the striations 
previously produced by thermal etching: as slip occurs on the {111} planes in silver 
and the slip lines are thus the intersections of {111} planes with the surface, it is 
a reasonable conclusion that the faces of the steps oausing the striated appearance 
are {111} planes, since their intersection with the surface is in all eases parallel to 
a set of slip lines. This disagrees with the results of Graf (1942) who identified the 
planes as {100} planes, but it is possible that there may be cases where the orienta¬ 
tion of the surface of the specimen relative to crystal is such as to favour the 
development of {100} planes. 

(h) Summary of results 

Briefly summarized, the results of the experiments may be stated as: 

(i) Grain boundaries appear as grooves when silver is heated in air, oxygen, 
nitrogen, hydrogen and in a vacuum. They were observable in specimens heated 
in air at temperatures as low as 300 ° C. 

(ii) Striations appear when oxygen is present in the atmosphere and may be 
caused to disappear by heating in nitrogen. The lowest temperature at which they 
were observed on heating in air was about 500 ° C. The directions of the striations 
were consistent with their being due to the development of {111} faces. 

(iii) Both striations and grain-boundary grooves are formed at the high temperature 
and are not produced by the cooling process. 

(iv) When grain growth occurs the grooves move forward with the boundaries 
and there are not, in general, scars left at previous positions. 

4. Discussion 

The rate of evaporation of silver in an inert gas at atmospheric pressure is but 
a few hundredths of that in a vacuum (Rosenhain & Ewen 1912). This is because 
most of the evaporating atoms are reflected after collision with the gas molecules 
and recondense on the surface. In effect, under these conditions silver exists in 
equilibrium with its vapour and an explanation of thermal etching will be sought 
in a consideration of the equilibrium configuration of the surface, i.e. that which 
has the lowest Gibbs Free Energy. Under such conditions where the nett loss of 
silver is negligible, any change from a polished to an etched surface occurs by the 
transfer of silver atoms from one part of the surface to another and the final 
equilibrium form will be independent of the mechanism of this transfer. In 
a vacuum, however, evaporation occurs without subsequent condensation and it 
may well be that the surface produced by this process bears no relation to the 
equilibrium surface. 



474 B. Chalmers, R. King and R. Shuttleworth 

(a) The formation of boundary grooves 

A detailed picture of the grain boundary is not necessary for the interpretation 
of the grain-boundary grooves. The grain boundary must be a region of comparative 
atomic disorder; because of this the free energy of a polycrystalline metal is greater 
than that of a single crystal with the same dimensions. Formally this excess free 
energy may be regarded as a free energy per unit area of the crystal boundaries; 
this is analogous to the free energy per unit area attributed to the heterogeneous 
regions between different phases, in the thermodynamic description of multi-phase 
systems (see, for example, Guggenheim 1933). 

When a metal is heated and no chemical action takes place, the only changes 
which can occur are rearrangements of the atoms to reduce the total free energy 
of the metal to a minimum. Assuming that the grains are strain free this free energy 
for a given specimen can be divided into three parts: (i) The free energy which the 
mass of the metal would possess if it were at the centre of a strain-free lattice. This 
is obviously constant, (ii) An additional free energy associated with the surface of 
the specimen. The value of the free energy per unit area associated with each 
element of surface may be expected to depend upon the crystallographic indices 
of that element of area, (iii) An additional free energy associated with the grain 
boundaries. The value of the free energy per unit area associated with each element 
of area of boundary may be expected to depend upon which crystallographic planes 
are in contact at this element of area, i.e. upon the relative orientation of the two 
crystal grains meeting at the boundary and the direction of the boundary. It is 
seen therefore that the approach to a minimum of free energy in a strain-free 
specimen must take place by the reduction of the surface and grain-boundary free 
energies. 

Adam (1941) shows that in obtaining equilibrium conditions it is mathematically 
equivalent to regard a surface having a free energy per unit area as having a tensile 
force per unit length of the same numerical value and acting parallel to the surface. 
Equilibrium conditions for the grain boundaries can therefore be obtained by 
regarding them as the seat of surface-tension forces. At the intersection of a grain 
boundary with the surface, therefore, there are three tension foroes acting at a line; 
the grain-boundary surface tension, and the surface tensions of the two crystalline 
faces meeting the boundary. The conditions for equilibrium therefore are given 
by figure 1. The plane of the figure is perpendicular to the line of intersection of the 
foroes. T a is the surface tension of the surface of grain A, T B that of B, and the 
surface tension of the grain boundary. The condition for equilibrium is then 
T AB j sin y =* I^/sin /? » T B jsin a, and it is clear that a groove should form on the 
surface at the intersection of the boundary with it. 

While the conditions so far considered specify the angles in the boundary grooves, 
their depths are controlled by the grain size, the equilibrium state being reached 
when each grain surface is a convex cap intersecting its neighbours at the equilibrium 
angles. The grooves which are observed in polished specimens represent an inter¬ 
mediate state in the approach to equilibrium. 



Thermal etching of silver 475 

No measurements have yet been made of the angle y at the bottom of grooves, 
but the photomicrographs indicated that it may be small enough to mean that the 
free energy of the boundary is comparable with those of the crystal surfaces. While 
this in itself is surprising, it is possible that the free energies of the crystal surfaces 
have been reduoed from those of clean surfaces by the presence of adsorbed gas. 
It is therefore suggested that quantitative measurements would have little 
significance except where made on perfectly clean surfaces. It is possible that 
grooves might not appear on such surfaces. 

vapour 

\ / 



The consideration of the boundary as a seat of surface-tension forces should be 
significant in cases where grain growth (as distinct from recrystallization) occurs. 
The changes produced by the grain growth must reduce the free energy of the 
system, and the only manner in which this can be brought about is by reduction 
of the grain-boundary energy. The conditions for equilibrium at the intersection 
of the three grain boundaries are similar to those for the intersection of the boundary 
with the surface, i.e. the angles between the boundaries are governed by the 
relationship T AB j%m y = T BC jBin <x = T CA /m\ ( 5 , where T AH is the surface tension 
of the boundary between grains A and B and y is the angle made by grain t\ etc. 
If the free energy per unit area of each boundary were a constant for that boundary 
and independent of its direction, the boundary area should for equilibrium be 
a minimum and the boundaries should be surfaces of zero curvature. A metastable 
state would then exist when the boundaries met at the correct angles and were 
surfaces of zero curvature. 

Migration of the boundaries should occur in an endeavour to satisfy these con¬ 
ditions. However, this cannot be checked from the grain growth observed in these 
experiments as the surface of the specimen intersects the boundary network at an 



476 


B. Chalmers, R, King and R. Shuttleworth 

arbitrary angle. Suitable information could be obtained by using specimens in 
which the recrystallized grain size was larger than the thickness of the sheet, so that 
the boundaries were perpendicular to the surface, 

(6) Possible mechanisms for the formation of boundary grooves 

When the silver is heated in the presence of an inert gas or in a vacuum the 
boundary grooves appear on the surface. Possible mechanisms for the transfer of 
silver from one portion of the surface to another are: slip, evaporation and con¬ 
densation of silver in atomic form, surface mobility of silver atoms or ions. We 
shall try to estimate the relative importance of these possible mechanisms. 

Slip is the normal process by which metallic crystals are plastically deformed, and 
effectively it is the slipping of whole atomic planes over each other in particular 
crystallographic directions. In view of the large forces necessary it is extremely 
doubtful whether the surface-tension forces are sufficient to cause any appreciable 
slip deformation even at high temperatures. 

As suggested by Rosenhain & Ewen (1912), boundary grooves could be produced 
by the preferential evaporation of grain boundary metal, and the fact that they 
and also Fonda (1923) found that small grained metals showed a greater rate of 
evaporation in vacuum seems, at first sight, strong evidence for this. However, 
a rough calculation shows that this explanation is not adequate. To estimate the 
excess rate of evaporation of boundary material it will be supposed that the 
crystals are separated by a layer of super-cooled liquid. At a temperature T° K 
near the melting point 7 \ f ° K, integration of the Clausius-Clapeyron equation gives 
for the ratio of the vapour pressure of the super-cooled liquid to that of the solid 

where F is the heat of fusion. 

If the condensation coefficients of liquid and solid are the same, the rates of 
evaporation will be almost in the same ratio. Even at temperatures appreciably 
below the melting point this ratio is not much greater than unity ; for silver at 
900 ° C, for example, the ratio is 1 - 06 . Crude as this calculation is it shows that the 
excess vapour pressure of the boundary material is not sufficient to explain the 
greater rate of evaporation when a small-grained metal is heated in a vacuum. 
Rosenhain & Ewen found that for a small-grained metal the loss of weight was 
50 % greater than for a large-grained metal. For a grain diameter of 10~ a cm. and 
a boundary of thickness 10~ 7 cm. the boundary would occupy but one fifty- 
thousandth of the surface area, necessitating a vapour-pressure ratio of about 
2x10 4 to explain the results on these lines. The figure 10~ 7 cm. for the boundary 
thickness, while adequate for Kl’s interpretation (1947) of the anelastio properties 
of aluminium, is probably too small, but it would need to be considerably increased 
before the increased evaporation of small-grained material could be explained. 

Rosenhain & Ewen did not believe that the excess vapour pressure of the 
amorphous layer accounted for all the enhanced loss of weight, but thought that 



Thermal etching of silver 477 

some was due to evaporation from the walls of the grooves onoe they had formed. 
However, this cannot be; if we assume that the condensation coefficient of metallic 
atoms ia unity, some of the atoms evaporating from one part of the groove condense 
on the opposite side. Direct application of Knudsen's Cosine Law shows that the 
nett evaporation from a groove is only equal to that which would occur from the 
plane surface occupying the neck of the groove. Nor does this small difference in 
the vapour pressures seem to be sufficient to explain the rapid formation of grooves 
which reach a depth of about 10~ 3 cm. in a few hours. 

While it is maintained that Rosenhain & Ewen’s and Fonda’s observations of 
the enhanced loss of weight in small-grained material cannot be explained by the 
evaporation of an amorphous grain-boundary cement, the experimental fact cannot 
be gainsaid and further work to explain it is contemplated. 

Surface mobility is probably a more important mechanism in the formation of 
the boundary grooves. Frenkel (1945) has shown that a necessary preliminary to 
evaporation is the presence of mobile atoms on the surface. Because the activation 
energy necessary for an atom to become mobile is less for surface migration than 
for evaporation followed by condensation, it is to be expected that the surface 
migration of atoms or ions will be a more important mechanism for the achievement 
of equilibrium than evaporation. Frenkel (1945) has developed a detailed theory 
for the change of shape caused by surface-tension forces which involves super¬ 
imposing drift velocities in particular directions upon the random motion of atoms. 

Andrade & Martindale (1935) have shown that surface migration occurs in silver 
on heating to temperatures as low as 280 ° C. They sputtered silver films 30 to 100 
atoms thick on glass and on heating them in a vacuum of lO -2 mm. found that the 
film thickened in some places and flat crystals with {111} faces developed leaving 
the silver film thinner but still continuous on the remainder of the surface. In view 
•of Johnson's discovery (1938) that the surfaoe structure of tungsten filaments 
depends upon whether they are heated by A.C. or D.C. and Mrowca’s (1943) similar 
observations on tantalum, it is probable that while surface migration is the most 
important mechanism for the formation of the boundary grooves it is the migration 
of positive ions not neutral atoms as considered by Frenkel (1945). 

The mechanism of surface migration accounts for the fact that when the grain 
growth occurs the boundary grooves always coincide with the instantaneous 
positions of the boundaries and scars seldom remain in the old positions. After the 
grain boundary has moved away from a particular point of intersection with the 
surfaoe the free energy condition for the existence of the groove no longer*holds 
and the equilibrium condition for that particular portion of the surface is that it 
should be smooth or striated. The ions then migrate over the surface and fill in 
the groove. If an artificial groove is made by scratching the surface, the same 
effect occurs; when the metal is heated, the fine structure of the scratch disappears 
in about 20 min. at 850 ° C. The occasional appearance of scars corresponding to 
previous positions of the grain boundaries when grain growth occurs is explained 
by the fact that if for any reason the boundary halts for some time in one position 



478 


B. Chalmers, R. King and R. Shuttleworth 

the groove will get deeper with time and when the boundary moves away from that 
position the grooves take longer to fill in* If the grooves were due to preferential 
evaporation of grain-boundary material, then during grain growth material would 
be lost more or less evenly over the surface of the growing grain and there would 
be a step rather than a groove at the grain boundary. 

The complete reversibility of the processes by which the striations appear and 
disappear when the silver is alternately heated in oxygen and nitrogen suggests 
that the striations exist in equilibrium with an atmosphere containing oxygen and 
are not merely the result of a particular mechanism of evaporation or oxidation. 
It is supposed that the presence of the oxygen modifies the equilibrium configura¬ 
tion of the silver surface, so that a stepped structure in which specific orystallo- 
graphic planes are developed has less energy than a smooth surface of less area 
but at arbitrary orientation. If this is so, the free energy per unit area of the specific 
planes developed must be significantly smaller in an environment of oxygen than 
that of other planes. 

That, for anionic crystal, the free energy per unit area of a crystal face depends 
on its crystallographic direction was shown by Lennard-Jones & Taylor (1925), who 
found by calculation that for rock salt the surface energy of a {110} face is almost 
four times the surface energy of a {100} face, so that in equilibrium the {100} face 
would be expected to develop preferentially. Lukirsky (1945) found that on heating 
a spherical crystal of rock salt it developed crystallographic facets and changed its 
shape to that of a polyhedron. 

Since in inert atmospheres no striations appear, it may be assumed that any 
difference in the free energies per unit area of different crystallographic planes is too 
small to cause the formation of striations with the consequent increase of area. 
In oxygen, however, the relative values of free energies per unit area must be 
considerably modified. The possible interactions between the silver and the oxygen 
are: solution of oxygen in silver, the formation of an adsorbed monolayer on the 
surface of the silver, and the existence of silver oxide molecules in the vapour phase. 
Because the amount of oxygen dissolved in silver is small, increasing only to about 
2 atomic per cent at the melting point (Steacie & Johnson, 1926), and because the 
partial pressure of silver oxide in the vapour phase will be small, it is probable that 
the most important effect is the preferential adsorption of oxygen on different faces. 
Direct experiments on the heterogeneous equilibrium of silver and oxygen are not 
sufficiently sensitive to show the presence of molecular oxide in the vapour phase; 
it must therefore be small compared with the equilibrium pressure of oxygen, but 
it is probably a number of times greater than the partial pressure of atomic silver 
vapour. Its existence may, however, be inferred from the enhanced evaporation 
of silver in the presence of oxygen, it being supposed that silver evaporates in the 
form of molecular oxide. 

When silver is heated in air or oxygen at atmospheric pressure it will evaporate 
in the forms of atomic silver and molecular oxide. As for evaporation in the presenoe 
of an inert atmosphere it will be assumed that most of the evaporating atoms and 



479 


Thermal etching of silver 

molecules are reflected and recondense on the surface so that the silver exists in 
equilibrium with its surroundings. Under these conditions, by the principle of 
detailed balancing, each process of evaporation will be balanced by the condensa¬ 
tion of an equal amount of the same constituent. 

In a consideration of the meohanisms whereby the striated equilibrium-surface 
configuration may be approached in an atmosphere of oxygen all three mechanisms 
—slip, atomic evaporation and condensation, and surface migration, which may 
contribute to the formation of boundary grooves, and in addition the evaporation 
and condensation of molecular silver oxide—must be considered. In so far as the 
evaporation bf molecular silver oxide tends to expose certain crystallographic 
planes it may be interpreted as the preferential attack of those planes by the 
oxygen, but this is not the only mechanism as was suggested by Leroux & Raub. 
There must be redistribution of the silver by surface migration, evaporation and 
subsequent condensation of silver atoms and the condensation of oxide from the 
vapour phase. It is impossible to estimate the relative importance of the con¬ 
tributions of these mechanisms to the attainment of the final surface configuration, 
but that the transfer of material without the formation of the oxide is an important 
mechanism is clearly demonstrated by the fact that the striations disappear without 
appreciable loss of weight when the specimen is heated in nitrogen. 

The authors wish to record their appreciation of the assistance of Professor 
F. Simon, F.R.S., Dr W. M. Jones and Mr A. F. Brown in the electron microscope 
examination of some silver surfaces and of Mr C. G. Earley in much of the micro¬ 
scopy. This work forms part of a general investigation carried on at the Royal 
Aircraft Establishment and the authors are indebted to the Ministry of Supply for 
permission to publish. 

Appendix 

A furnace for high-temperature microscopy 

The requirements set in the design of a furnace for use in the microscopic 
examination of specimens at high temperatures were that it could be used with 
a microscope objective having a useful magnification up to 260 ; that fairly large 
areas of the specimen could be scanned by movement of the specimen relative to 
the objective; and that the specimens could be examined in various atmospheres 
and at temperatures up to the melting point of silver. Furnaces for high-temperature 
photomicrography have beep described by Esser & Cornelius {1933) and by Swinden 
Howie & Chesters (1939), but neither meets all these requirements. 

The limiting feature in the design of such a furnace is the working distance of the 
objective. For objectives with the necessary numerical aperture for the magnifica¬ 
tions required, the distance between the hot specimen and the objective, which 
has to remain cool, is small and this necessitates very efficient insulation of the 
specimen and cooling of the objective. For the highest magnifications it was 
decided to use a 10 mm. achromatic objective with 0*28 numerical aperture and 



480 


B. Chalmers, R. King and R. Shuttleworth 

a working distance of 7 mm.; this has a maximum useful magnification of about 
250 . In view of the short working distance it was decided to use gas or a vacuum 
for thermal insulation rather than a porous refractory, and reduce radiation losses 
by silver plating all hot surfaces. The fumaoe finally constructed is shown in figure 9 . 
The base of the furnace, made of mild steel, stands on three adjustable legs on the 



Figure 9. Furnace for high-temperature microscopy. 






Thermal etching of silver 481 

specimen support of a Vickers projection microscope. At the centre of the base is 
a glass window consisting of a microscope cover glass which is sealed in position 
by means of I.C.I. high-temperature optical cement. The specimen—a 2 cm. square 
piece of 8 s.w.g. silver—is supported above and about 4 mm. away from the window, 
on a stand made of heat-resisting steel. The specimen rests over a circular hole in 
the centre of the stand. The stand, which consists of a square plate with three 
radiating arms, is supported by adjustable screws at the ends of the arms; the screws 
have spherical ends and rest in grooves in the base of the furnace. Above the 
specimen is placed a sheet of silica ^ in. thick and on this rests the heater which 
consists of nichrome tape (OT x 0*002 in.) wound on a similar silica sheet. Heater 
and specimen are enclosed in a box made of heat-resisting steel. The whole is 
enclosed by a cylindrical cap of mild steel which is provided with a flange and may 
be bolted to the base to form, with a synthetic rubber gasket, a gas-tight enclosure. 
Inlet and outlet tubes are supplied in the cap for the atmosphere required for the 
test. The specimen is drilled so that a thermocouple may be inserted at its centre, 
and the thermocouple leads pass out through the wall of the heat-resisting steel 
hot-box by means of alumina insulators, as do the heater wires. Both thermocouple 
leads and heater leads pass out of the furnace through silvered porcelain insulators 
which are soldered into the base. The thermocouples are waxed into the insulators 
and the heater current is oarried in copper wires which are soldered to the insulators. 
All the portions of the hot-box are silvered to reduce radiation losses and the 
supporting arms and screws are thin to reduce conduction. The upper surface of 
the mild steel base is silver plated and polished to reflect radiant heat. 

Below the mild-steel base is the cooling collar which rests on the microscope 
specimen support. The legs of the furnace base are adjusted so that there is 
a distance of about 1 mm. between the collar and the base. The microscope objective 
lies in the central space of the cooling collar with its front lens a little below the 
level of its top surface. Compressed air is supplied at an initial pressure of 10 Ib./sq.in, 
through the two inlet tubes and allowed to expand into the annular space inside 
the collar. After expanding in this space it flows on to the surface of the lens and 
thence radially outwards between the collar and the furnace base, thus cooling both 
objective and glass window. Experiments using a dummy objective carrying 
a thermocouple showed that the temperature of the front lens did not rise more 
than two or three degrees above room temperature. 

It has been found that a specimen temperature of 920 ° C can be maintained with 
a power consumption of 300 W. Because of the small heat oapacity of the furnace 
it is not possible to regulate the temperature with a commercial controller. The 
current is therefore supplied from a constant-voltage transformer and this enables 
the temperature to be controlled by hand to within 2 ° C without too frequent 
attention. When setting up the furnace, the height of the base is first adjusted to 
give the required clearance between it and the collar, then with the objective front 
lens just below the top surface of the collar the specimen is levelled and its height 
adjusted by means of the screw supports of the hot-box. The specimen is set a little 



482 B. Chalmers, ftnd E, Shuttlewoith 

below the ootrect height for focus in order to allow for expansion when the apparatus 
heats up* Afterwards, when the cap is on and the furnace running, focusing is 
achieved by means of the microscope fine adjustment which moves the objective* 
Difficulty was experienced with a form of contamination which has not yet been 
fully explained. It was found that after a period of heating in the furnace striations 
gradually disappeared, the process spreading slowly over the surface. It was found 
by auxiliary tests that this could be caused by contamination of the surface by 
some component of the heat-resisting steel. Although this has been reduced by the 
silver plating on the steel, investigations still have to be limited to 3 hr. The thermo¬ 
couple and heater wires also form possible sources of contamination. This effect is 
being investigated in detail. 

References 

Adam, N. K. 1941 The Physios and Chemistry of Surfaces, p. 2. * Oxford University Press. 
Andrade, E. N. da C. & Martindale, J. G. 1935 Phil . Trans. A, 335, 69. 

Renton, A. F. A Drake, L. C. 1932 J. Amer. Chem. Soc. 54, 2186. 

Renton, A. F. A Drake, L. C. 1934 J. Amer . Chem, Soc . 56, 255. 

Boas, W. A Honeycomb©, R. W. K. 1946 Proc. Roy Soc. A, 186, 57. 

Boas, W. A Honeyoombe, R. W. K. 1947 Proc . Roy. Soc. A, 188, 427. 

Carpenter, H. C. H. A Elam, C. F. 1920 J . Inst. Metals , 24, 83. 

Chaston, J. C. 1945 J. Inst. Metals, 71, 23. 

Day, M. J. A Austin, J. B. 1940 Trans. Amer. Soc. Metals , 28, 354. 

Elam, C. F, 1936 Trans. Faraday Soc * 32, 1604. 

Esser, H. A Cornelius, H. 1933 Metals and Alloys , 4, 119. 

Swing. J, A, A Rosenhain, W. 1900 Phil. Trans. A, 193, 353. 

Fonda, G. R. 1923 Phys. Rev. 21, 343. 

Frenkel, J. 1945 .7. Phys. U.S.S.R. 9, 392. 

Graf, L. 1942 Z. Electrochem. 48, 181. 

Guggenheim, E. A. 1933 Modem thermodynamics , p. 160. London: Methuen. 

Gwathmey, A. T. A Benton, A, F. 1940a J. Chem. Phys. 8, 431. 

Gwathxney, A. T. A Benton, A. F. 19406 J . Chem. Phys , 8, 569. 

Gwathmey, A. T. A Benton, A. F. 1942 J. Phys. Chem. 46, 969. 

Hemingway, E, H. A Ensminger, G. R. 1922 Trans. Amer. Inst . Min. (met.) Engrs, 67, 392. 
Humfrey, J. C. W. 19*2 Carnegie Schol. Mem. 4, 80. 

Johnson, R. P. 1938 Phys, Rev . 54, 459. 

K6, T. S. 1947 Phys. Rev . 71, 544. 

Keyes, F, G. A Har&, R. 1922 J.Amer. Chem. Soc. 44, 479. 

Kroll, W. J. 19x2 Carnegie Schol. Mem. 4, 8. 

Lennard-Jones, J. E. A Taylor, P. A, 1925 Proc . Roy. Soc. A, 109, 476. 

Leroux, J. A. A. A Raub, E. 1930 Z. anorg. AUg . Chem . 188, 205. 

Lewis, G. N. 1906 J. Amer . Chem. Soc. 28, 139. 

Lukirsky, P. I. 1945 CM. (Doklady) Acad . Sci. U.S.S.R. 46, 274. 

MiUer, 0. O. A Day, M. J. 1942 Trans. Amer . Soc. Metals, 30, 541. 

Mrowca, B. A. 1943 J. Appl . Phys. 14, 684. 

Parker, E* R. A Smoluohowski, R. 1945 Trans. Amer , Soc. Metals, 35, 362. 

Rawdon, H. S. A Berglund, T. 1928 Sci. Pap. Bur. Stand, no. 571, 649. 

Rawdon, H. S. A Scott, H. 1922 Trans. Amer . Inst. Min. (met.) Engrs. 67, 414. 

Rosenhain, W. A Ewan, D. 1912 J . Inst. Metals , 8, 149. 

Rosenhain, W. A Humfrey, J. C. W. 1910 Proc . Roy. So g. A, 83, 200. 

Shuttleworth, R„ King, R. A Chalmers, B. 1946 Nature, 258, 482. 

Shuttleworth, R„ King, R. A Chalmers, B. 1947 Metal Treatment, 14, 161. 

Staacie, E. W. R. A Johnson, F. M. G. 1926 Proc. Roy. Soc , A, ll2, 542, 

Swindon, T., Howie, T. W. A Chesters, J. H, 1939 Trans, Brit. Ceram* Boe* 38,343. 





Chalmers et al. 


Proc. Roy. Hoc. A, volume 193 , plate 10 



Chalmers et al. 


Proc. Roy. Soc. A, volume 193 , plate 11 





Chalmers et al. 


Proc. Roy. Soc. A , volume ] 93 , plate 12 













. I 

DBBCBOT30K Of PLATXS 9 TO 12 


Plate 9 

Figure 2. Silver heated in air for llhr, at different temperatures. (a) S00°€. (b) 500°C. 
<c)70O°C. <d)$50°C. <e)940 6 a (x250.) 

Figure 3. Silver heated in air for 11 hr. at different temperatures, (a) 500° C, (6) 850° 0. 
(c) 840° C. (x 1500.) 

Plat® 10 

Figure 4, Silver heated in different atmospheres at 920° C. (a) 11 hr. in air, (6) 11 hr. in oxygen* 
(e) 11 hr. in nitrogen, (d) ljhr. in a vacuum. ( x 250.) 

Figure 5, Some etch patterns obtained on silver heated in oxygen for 11 hr. at 920° C. { X 2000.) 

Plate 11 

Figure 6 . The reversibility of striations, Silver heated at 920° C. (a) 11 hr. in nitrogen. (6) Same 
field after 1 hr. in air. (c) AfteT a further 1 hr. in nitrogen, (d) After a further 2 hr. in 
nitrogen, (e) After a further 8 hr. in nitrogen. (/) After a further 1 hr. in air. ( x 250.) 

Figure 7. The determination of the planes causing the striations. <o) Silver showing striations 
produoed by heating in air. (6) Silver showing slip lines due to deformation after the etching 
striations had been removed by heating in nitrogen. ( x 250.) 


Plate 12 

Figure 8. High-temperature microscopy. Photomicrographs of a silver specimen taken while * 
the specimen temperature was raised at 920° C and maintained at that temperature in air. 
Same field throughout. ( x 100.) (h) After Cmin. temperature up to 575° C. (6) After 9 min. 
temperature up to 670° C. ( o ) After 19 min. temperature up to 850° C. (d) After 25 min. 
temperature up to 900° C. (e) After 40 min. temperature up to 917° C. (/) After 50 min, 
temperature up to 927° C. (g) After 60min. temperature at 920° C. (k) After 70min. 
temperature at 920° C. (i) After 90rain, temperature at 920° C. 


VoL 193 . A. 


3 * 



The theory of the galvanomagnetic and thermomagnetic 

effects in metals 

By E. H. Sondheimer, Trinity College, University of Cambridge 


(Communicated by A. H. Wilson , F.R.S.—Received 27 August 1947— 
Revised 8 January 1948) 

The methods of a previous paper are used to discuss the effect of a magnetic field on the 
thermoelectric power of a metal containing two overlapping energy bands of normal form. Exact 
solutions of the transport equation are obtained for the throe limiting cases of high temperatures, 
low temperatures and very strong magnetic fields, and it is shown that the formulae can be 
generalized to give approximate expressions for all temperatures and all fields. The magnetic 
change of the thermoelectric power is found to be small at very low and high temperatures, 
and to pass through a maximum at intermediate temperatures. 

The transverse galvano- and thermomagnetic effects are also considered, and the formulae 
which hold for free electrons are generalized so as to be approximately valid for all temperatures. 
For free electrons, the Hall coefficient remains constant as the temperature decreases, the 
Righi-Leduc coefficient increases, and the Ettingshausen and Ettingshausen-Nemst 
coefficients decrease and change sign at very low temperatures. The corresponding formulae 
for a metal containing two bands are also obtained, and are used to show that the theoretical 
predictions for free electrons cannot hold for real metals except in special cases. Finally, the 
two-band model is used to discuss the effect of the magnitude of the magnetic field on the 
coefficients of the transverse effects. 


1. Introduction 

M. The present paper is a sequel to a previous one (Sondheimer & Wilson 1947 ; 
subsequently referred to as I*), and continues the investigation by the methods of 
the modem theory of metals of the various effects which occur when a metallic 
conductor carrying an electrical or thermal current is placed in a magnetic field. 
The so-called galvanomagnetic effects arise when the primary current is electrical, 
and the thermomagnetic effects when it is thermal. The effects can also be sub* 
divided into ‘transverse effects in a transverse magnetic field', where the primary 
current and the electric field or temperature gradient of interest are at right angles 
to each other and to the magnetic field, ‘ longitudinal effects in a transverse magnetic * 
field ’ where they are parallel to each other but at right angles to the magnetic field, 
and ‘longitudinal effects in a longitudinal magnetic field' where they are parallel 
to each other and to the magnetic field. The second and third groups of phenomena 
include the magneto-resistance effects which have been discussed in I and the 
influence of a magnetic field on the thermoelectric power which will be considered 
in the present paper. The most important examples in the first group, also to be 
discussed below, are the four effects which are often referred to simply as the 
* transverse galvano- and thermomagnetic effects 9 . These effects are usually described 

* There is an error in equation (70) of paper I. The index of the last factor on the right- 
hand side should be — 2 instead of — 1. 


[ 484 ] 



Oalvanomagnetic and thermomagnetic effects in metals 485 

by oertain coefficients which may be defined by considering a metal strip with its 
surface in the ary-plane, carrying an electric current J x or a thermal current w x in 
the ^-direction and subjected to a magnetic field H in the ^-direction. The following 
cases are then of practical importance: 

If *4 + 0 but the transverse current J y — 0, there exists a transverse electric field 
S y . The ratio £y/HJ x is known as the Hall coefficient A H . 

If J x + 0 and w y * 0, there exists a transverse temperature gradient 3 T/dy, and 
the ratio (- dT/dy)/HJ X is known as the Ettingshausen coefficient A s . 

If dT/dx + 0 and J v * 0, then <? v + 0 and the ratio & v [(-HdT/dx) is known as 
the Ettingshausen-Nernst coefficient B ES . The experimental values are also some¬ 
times expressed in tefms of the ratio & V /Hw x , which is denoted by A EN . Clearly 
Aen ® BenI k > where k is the thermal conductivity. 

If ST/ dx + 0 and w y = 0, then dT/dy + 0 and the ratio (dTjdy)/(HdT/dx) is known 
as the Righi-Leduc coefficient BjiL- An alternative coefficient is defined by 

Arl ” (~3T/3y)/J/tt> x — B rl /k . 

In many cases the thermal conditions in the y-direction are not specified, and it 
is then necessary to distinguish between ‘adiabatic’ conditions where w y = 0 and 
‘isothermal’ conditions where dT/dy =* 0. The consequences of this distinction are 
discussed in more detail in § 2. 

The calculations can be completed and the results compared with experiment 
only if the metal is represented by a sufficiently simple model. In the simplest 
possible case the electrons are assumed to be quasi-free, the energy being propor¬ 
tional to the square of the wave-vector. The results for this model have so far only 
been worked out for the case of high temperatures (Sommerfeld& Frank 1931; Kohler 
1941 a), where it gives the correct order of magnitude (disregarding the sign) of the 
four transverse effects in the limit of vanishingly small magnetic field. In the present 
paper the theory is extended to cover the case of arbitrary temperature. The free 
electron model, however, fails entirely to account for the magneto-resistance effects, 
which are zero for a model in whioh the surfaces of constant energy form a single 
family of spheres (Peierls 1931). It has proved impossible so far to carry out the 
calculations even part of the way with a model which shows all the peculiarities 
that the energy surfaces are likely to have in real metals. It is therefore necessary 
to discuss a model which is more general than the free electron model and whioh 
gives a non-zero result for the magneto-resistanoe effects, but which is sufficiently 
tractable for the calculations to be completed. The only model so far proposed which 
fulfils these requirements postulates the existence of two overlapping energy bands, 
an s-band containing some conduction electrons and a d-band containing some 
unoccupied levels; in eaoh band the energy is assumed to be proportional to the 
square of the wave-vector. In I it has been shown that this model gives a satisfactory 
account of the magneto-resistanoe effects which occur in a transverse field, over the 
whole range of temperatures and magnetic fields. In the present paper the same 
model is used to discuss the effect of a magnetic field on the thermoelectric power, 



486 


E. H. Sondheimer 


particularly at low temperatures where the effect is most pronounced but where the 
theory is more difficult owing to the non-existence of a time of relaxation. It is not 
necessary to consider the remaining thermoelectric effects, since they can all be 
expressed in terms of the thermoelectric power (see, for example, Wilson 1936 , 
§§5*41 to 5*43). Also, as in I, the case of a longitudinal magnetic field has to be 
excluded, since the two-band model is not sufficiently general to give a non-zero 
result for this case. 

The two-band model is further used to obtain formulae for the transverse effects 
in the limit of zero magnetic field. At best, the free electron model can only be 
expected to apply to the alkali metals, and it is therefore again necessary to in¬ 
vestigate more complicated models, so as to determine whether the theoretical 
results obtained for free electrons can be expected to apply to metals in general, 
and, if they do not apply, to suggest at least qualitatively how they can be general¬ 
ized. Finally, the two-band model is used to derive formulae for the field dependence 
of the transverse effects. As in the case of the electrical and thermal conductivities, 
the influence of a magnetic field on the coefficients of the transverse effects is deter¬ 
mined by the departure of the energy surfaces from spherical symmetry, and the 
two-band model is therefore the simplest which can be expected to give even 
qualitatively correct results. 

1 * 2 . The metals to which the two-band model can be directly applied and in 
which the and rf-bands have immediately obvious physical meanings consist of 
the transition elements (excluding the ferromagnetic metals when transport effects 
are being considered). It was, however, pointed out in I, p. 448 that the model could 
be applied to other metals provided that the parameters were suitably defined. 
An expanded form of the argument in which the assumptions are more explicitly 
stated is as follows. 

The energy surfaces in k-space (k being the electron wave-vector) have a very 
complicated structure even for the simplest of metals such as the alkalis (see, for 
example, Sommerfeld & Bethe 1933 , p. 400, figure 23). It is, however, possible to 
give a formal theory of all the transport phenomena provided that the assumption 
is made that a time of relaxation exists (e.g. Wilson 1936 , chapter v). The various 
physical quantities can then be expressed in terms of the time of relaxation r, the 
energy E, the derivatives of E with respect to k, and in some cases the derivatives 
of r with respect to k. Unfortunately, a time of relaxation does not exist for a general 
model, and the only cases in which the formal theory applies is when E is a function 
of | k) only, and even then only at high temperatures. To make any further progress 
it is necessary to make simplifying assumptions, which are usually indicated by the 
structure of the expressions which occur in the formal theory. If one wishes to 
carry out the calculations completely (and this is essential if one wishes to discuss 
low temperature phenomena at all), it is necessary to choose the simplest possible 
model which contains the features which axe shown to be essential in the formal 
theory. For example, in the formal theory the electrical conductivity only depends 



Oalvanomagnetic and thermomagnetic effects in metals 487 

upon r and the first derivatives of E ; and in fact only the squares of the latter occur* 
It is therefore unnecessary, if one restricts oneself to order of magnitude calculations, 
to consider a model in which the first derivatives change sign* This is the justification 
for the very wide use of the free electron model for discussing the electrical conduc¬ 
tivity of metals whether monovalent or not. In some ways it is better to use the 
two-band model for divalent metals since the meaning of the parameters becomes 
clearer, and it is possible to see whether, for example, the average effective mass of 
the electrons is the arithmetic or harmonic mean of the effective masses of the in¬ 
dividual electrons. If, however, the parameters in the more general models cannot 
be distinguished by experiment from those for simpler models, it is best to choose 
the simplest model that gives a non-zero result. 

In general, the magnetic effects involve the second derivatives of the energy 
linearly, and the simple free electron model is inadequate even for the alkalis. The 
real energy surfaoes have a complicated structure and oonsist of portions having 
positive curvature and portions having negative curvature (but it must be remem¬ 
bered that only those energy levels near the highest occupied level play a part). 
The two-band model is the simplest model having this property, the s-band having 
positive and the d-band having negative curvature. It does not, however, have all 
the properties required. For example, in a real metal the relevant portions of the 
energy surface which have negative curvature may not form a continuous domain, 
nor will they necessarily possess a maximum energy level. But such factors as these 
do not affect the calculations at all, though they have a strong bearing on the 
physical interpretation of the parameters. In a transition element the d-band has 
an obvious physical significance, and the number of vacancies in it can be defined as 
the number of electrons required to fill the uncompleted inner shell, so that the 
two-band model is easy to interpret. In the case of a monovalent or divalent metal 
there is no inner shell to complete, but one can still consider the d-band of the two- 
band model to be a reasonable representation of those portions of the energy surfaces 
that have negative curvature.* However, it is not possible to determine the number 
of vacancies to be ascribed to the d-band in the same kind of way as for the transition 
elements, and in the present state of the theory one can only treat it as a parameter 
of the same order of magnitude as the number of atoms. If one had to assume that 
it was of a very different order of magnitude, the physical interpretation would 
obviously be at fault. 

It is possible to introduce further refinements into the two-band model to make it 
correspond more closely to reality. For example, one could introduce &~d transitions, 
corresponding to transitions in which the initial and final states lie in the regions of 
positive and negative ourvature respectively. Such refinements, however, seem 

♦ In a divalent metal the conduction bands are better labelled lower and upper bands to 
avoid the implication that the symbols ‘s’ and have any physical significance. The con¬ 
duction bands cannot be strictly labelled 8, p f d t ... bands, but are mixed bands. The first 
and second bands are both partially filled with electrons, and the energy levels which deter¬ 
mine the conduction properties are primarily of negative ourvature in the lower band and of 
positive ourvature in the upper band. 



488 E, H. Sondheimer 

unnecessary in the present state of development where the experimental data are 
so inadequate. 

To sum up, the two-band model assumes that only derivatives of the energy up 
to the second are important. It is then the most general model required for isotropic 
metals, and the calculations can be carried out for all temperatures. The model 
could be generalized to deal with anisotropic metals, but the calculations would 
then have to be restricted at best to high temperatures where a time of relaxation 
exists, since it has not so far been possible to extend the theory to anisotropic metals 
at low temperatures. (It should be noted, however, that no proof has yet been given 
that a time of relaxation exists for anisotropic metals even at high temperatures.) 
The model breaks down (as do all models so far proposed) if derivatives of the energy 
of the third or higher orders are important, and in certain cases if the derivatives 
of r have to be taken into account. According to this criterion the model must fail 
for those galvanomagnetic and thermomagnetic effects which occur in a longitudinal 
magnetic field, but should be sufficient to deal with the order of magnitude of the 
effects which occur in a transverse field. If in the future it is found that there is a 
substantial difference in the behaviour of transition and multivalent metals as 
regards the effects discussed here, the conclusion would be that a much more 
elaborate theory of the magnetic effects is required than the one given here. But such 
evidence as exists does not suggest any such fundamental difference, and it is there¬ 
fore reasonable to proceed on the basis of the simplest possible assumptions. It is 
very desirable, however, that some model should be investigated for which drjdE 
behaves anomalously. 

1 - 3 . In §2 the formal expressions for the various effects are obtained according 
to the two-band model, and in § 3 expressions are given which, together with formulae 
already obtained in I, represent the complete solutions in the three limiting cases of 
very high and very low temperatures and very strong magnetic fields. 

It was shown in I that, although it is not possible to solve the transport equation 
in the general case, simple approximate ‘interpolation formulae’ can be set up for 
the magneto-resistance effects which reduce to the correct expressions in the three 
limiting cases for which the exact solutions are known. In § 4 the thermoelectric 
power is considered, and it is shown that it is again possible to generalize the exact 
solutions so that they can be used over the whole temperature range. To do this, 
simple interpolation formulae are set up for the thermoelectric power and the 
adiabatic Ettingshausen-Nernst coefficient of a metal containing a single band, 
corresponding to the known expressions for the electrical and thermal conductivities 
(Wilson 1937). These formulae make it possible to determine the generalized forms 
of all the quantities required for discussing the two-band model, satisfying the usual 
criterion of reducing to the exact expressions in the limiting cases of high tem¬ 
peratures, low temperatures and strong magnetic fields. It is thus possible to 
calculate general expressions of approximate validity for all the effects to be con¬ 
sidered in the present paper. 



Galvanornagnetic and thermmnagnetic effects in metals 489 

The change of the thermoelectric power in a magnetic field at various temperatures 
has been investigated experimentally by Griineisen & Erfling (1939) for beryllium* 
The effect is found to be small at high and very low temperatures and to pass through 
a maximum value at an intermediate temperature, the position of the maximum 
moving towards higher temperatures as the magnetic field is increased. It is shown 
in § 4*21 that the predictions of the two-band model are in general agreement with 
this behaviour. 

The free electron model has been used by Sommerfeld & Frank (1931) and Kohler 
(1941 a) to discuss the transverse effects in the special case where it is legitimate to 
assume that a time of relaxation exists. The results for this case, which are only valid 
for high and extremely low temperatures, are summarized in § 5 - 1 . The generalized 
formulae which are approximately valid over the whole temperature range are given 
in § 5 * 2 . These formulae show that, for free electrons, A n is negative and independent 
of the temperature for all temperatures, B nt is negative and increases in absolute 
value as the temperature decreases, while B KN and A E are positive at high tem¬ 
peratures, decrease as the temperature decreases, and at very low temperatures 
they reverse in sign, pass through a minimum value and eventually tend to zero 
as T approaches zero. Similar results hold for a single d-band, except that A H and 
B rl are positive instead of negative. 

In § 5*3 the extent to which the two-band model modifies these results is in¬ 
vestigated, and it is shown that the theoretical predictions for free electrons cannot 
be expected to apply to real metals except in special cases. In particular, the 
two-band model allows A H to vary with the temperature, and B KN and A K need no 
longer become negative at very low temperatures. Further, the simple relations among 
the coefficients which hold for free electrons are, in general, no longer valid for the two- 
band model, and are therefore not generally true. However, the two-band model 
fails to account for the negative values of B KN and A% which are observed in some 
metals at high temperatures; the explanation of these anomalous signs would require 
a still more complicated model, with the property that the time of relaxation is a 
decreasing function of the wave number at the surface of the Fermi distribution. 

In § 0 the two-band model is used to derive formulae for the field dependence of 
the transverse effects, and these are used to give a qualitative discussion of the 
behaviour of the coefficients in a magnetic field of arbitrary magnitude. The Hall 
coefficient has already been discussed by Jones (1936) in the case of high tem¬ 
peratures, where a time of relaxation exists. It is shown in § 6 that a formula of the 
same form can be used over the whole temperature range, and a similar formula is 
obtained for b rl ■ The behaviour of B EN and A K is, in general, complicated and only 
some simple special cases are discussed; in particular, the form of the coefficients in 
the limit of very strong magnetic fields is considered. 

1 * 4 . The experimental data on the transverse effects with which to compare the 
theory are discussed in § 7 . They are both inadequate and confusing. Results reported 
by different workers differ by factors of up to 80 (compare Borelius 1935; Meissner 



490 


E. H. Sondheimer 

1935); it is very seldom that the various effects have been measured on the same 
specimen and no attention seems to have been paid to choosing metals for which the 
theoretical interpretation would be relatively easy. The inconsistency of the results 
available is shown in table 1 infra , where data are given for a quantity which should 
by thermodynamic arguments be unity but which differs very considerably from it. 
If we disregard ferromagnetic metals, strongly anisotropic metals such as bismuth 
and discard results which differ by a factor of more than 2 when measured by two 
different experimenters, there is very little experimental evidence left. Further, 
there are no data at all for those metals in which the parameters for the separate 8 - 
and d-bands can be estimated from other phenomena, so that it is only possible at 
present to attempt to verify the theory as regards orders of magnitude, and even 
then only by applying the principles laid down in § 1*2 supra and in I, p. 448 for 
interpreting the parameters. 

The position is even worse for the temperature variation of the transverse effects 
than for the orders of magnitude of the effects at room temperatures since, except 
for the Hall effect, there are at present no data at all for any metals at low tem¬ 
peratures. On the other hand, at ordinary temperatures {where some measurements 
exist) the results of the present theory are in all essential respects identical with 
those of the old treatments, and new effects are to be expected at low temperatures 
only (see § 5 ). Further, at room temperatures and for normal metals the effeot of the 
magnitude of H on the coefficients is negligible, whereas j*t sufficiently low tem¬ 
peratures the fields that are obtainable in practice are adequate to observe the 
effects (discussed in § 6) which occur when the magnetic field is very strong. For a 
verification of the new predictions of the present theory as regards the temperature 
and field variation of the coefficients experiments at low temperatures are therefore 
essential, and further discussion must be postponed until such experimental evidence 
has become available. 


2. General formulae 


In the two-band model, which has been described in I, there are two overlapping 
partially filled bands of normal form, the s- and d-bands. The general expressions 
obtained on the basis of this model for the electrical and thermal currents in an 
arbitrary transverse magnetic field have been given in I (equations ( 13 ) to ( 16 )), 
and it has been shown there that these equations can be used to express all the 
physical quantities of interest in terms of certain coefficients i {n) which depend on 
the magnetic field and on the 8 - and ^-electron variables. Only the expressions for 
the electrical and thermal conductivities <7 and k were deduced in I, and it is now 
necessary to calculate the corresponding expressions for the thermoelectric power 
and the transverse effects. 


When a temperature gradient dTjdx is maintained along a strip of metal but no 
electric current is allowed to flow, a longitudinal electric field S x is set up, and if 
this is written in the form g*, gy 



Ocdvanomagnetic and thermomagnetic effects in metals 491 


— 6 being the electronic charge and £ the Fermi energy level, then —©/6 s is known 
as the absolute thermoelectric power per degree. It will be assumed, as in the case 
of the thermal conductivity, that © is measured under adiabatic conditions, the 
thermal conditions in the ^-direction being such that w y ~ 0. Using thie condition 
in addition to *4 - «/ v ~ 0, it is found by means of I, equations (13) to (18), that 

_ + (Q 8 + Q d ) {(7,+%) (Y 9 W d ) (X 9 +jy} 

where & is Boltzmann’s constant, V, W, X, Y are certain combinations of the i (n) ’s 
defined by I, equations (20) and (22), while 

£*P - flP - £*4> - #> - £#>, (2) 

and the suffixes s and d refer to the 8 - and d-band respectively. 

The coefficients of the transverse effects are calculated similarly. In the case of 
the Hall effect, J x + 0, while J v ~ 0 and the specimen is kept at uniform temperature, 
so that dTfix = dTjdy = 0. It is then found that 


A 3 ^ W,-W d * _ 

“ e'H (V s + £)* + (HJ- W d )* ‘ W 

For the Ettingshausen effect J x ^= 0, while J u = dT/dx = w y = 0, and 

4 3^ 4 y (Q , + Q d ) ( V. t Vg) - (P. -P d )(W t - %) ... 

£ ~ e# {(K s +F rf )*~+(W;-lF d )*}(Z 8 +Z d ) • W 

The experimental conditions assumed here are never strictly fulfilled in practice, 
since the condition w v = .0 is at best only approximately valid, but the experimental 
values are usually corrected for any heat lost to the surroundings (compare Meissner 
1935, p. 365). A further correction is necessary if there is any temperature difference 
between the ends of the specimen. 

The Ettingshausen-Nemst effect is measured with dTfix^O, while J x ~J y =* 0 
and w v ■= 0 (adiabatic conditions). Denoting the adiabatic coefficient by B% N , it 
is found that 


(Q. + Qt) {(V, + v d ) (X, + X d ) - (W„ - W d ) (Y s - Y d )} 

- »T __ -(P.-PMr.+mY.-T4) + W-WMX.+X a )} 

* N ~ eH ~ {(K,+F - )* + W-W(X.+X < ) - 

(«) 

The experimental values must, however, differ from this quantity, since they are 
obtained without allowing for the thermoelectric field which is always present in 
the leads used to measure the transverse potential difference. A general discussion 
of this question has been given by Kohler (19416) who has shown that, if certain 
reasonable assumptions are made, the apparent value of the adiabatic coefficient 
turns out to be identical with the isothermal coefficient. This result is readily shown 
to follow also from the present equations, which are merely particular oases of 
Kohler’s; the proof follows exactly the same lines as his and the details are left to 



492 


E. H. Sondheimer 


the reader. It will accordingly be assumed that the value of the Ettingshausen- 
Nemst coefficient which is actually measured is obtained by using the condition 
dT/dy — 0 instead of w y = 0. Denoting this value by & * 8 f oun d that 

m - k 21 (fr+ Qa) (v*+v*) - P,-p*) W- w*) m 

KN ~ eH • (?.+£)• +(HJ-Hi)» ' ( ) 

Finally, for the Righi-Leduc effect 0 T/ 3 * + 0, while J x = J y =* 0, w y = 0, and 

£ i r.-fr m 

"KL- Jj Y Ty * l'/ 

XI ^ T 

The measured values have again to be corrected for any heat whioh is lost to the 
surroundings. * 

3. Special solutions 


The quantities V, W, etc. which appear in equations (1) to ( 7 ) can be evaluated 
explicitly in the three limiting cases of high temperature and arbitrary magnetic 
field, low temperature and arbitrary magnetic field, and high magnetic field and 
arbitrary temperature. The method of calculation has been described in I and the 
resulting expressions for V, W, X y Y have been given there (equations ( 27 ), ( 28 ), 
( 42 ) to ( 48 )), and it is now only necessary to supplement these results by giving the 
corresponding expressions for P and Q. The calculations follow the same lines as 
before, and only the results will be quoted here. For the details of the method and 
for the notation the reader is referred to I, § 3 . 

3 - 1 . High temperatures. ~. r ~ 

Q = ( 8 ) 

where V and W are given by I, equation ( 27 ). 


3-2. Low temperatures.* 


CP = M2 mC* 


/ TO *»+/?*£) 
'(« a + W 


, m*/ f g*(« a -3/?*g) 
V©/ r B («* + /?*£)* 


D 

t 


(D’( ta+w) 


l 


m% 

ft*A’ 


ft? = M 2 ™£)* 


a(3a®+ /?*£) 


( 9 ) 


_1 / JYI * 2a /?g*(3 cfi-jP Q D(TV(tor* \ 2«/?g*(* a - 3/?*g) ) m% 

2\@j r 5 (cV+PC? *^7 (a 2 +W /ft®A 


{a*+m 

3 - 3 . Strong magnetic fields 


£P 


_ JJ 27 !? 

~ a* ("3 


(2mQ»/?£‘ + 


Qva 


IT ' 


ft? = 


/ft®A' 
( 10 ) 

( 11 ) 


* The reader is reminded that a = eH/hc t that ft is a constant determining the effect of the 
impurities (in the notation of Wilson ( 1937 ). ft = ( 2 m)* M/4h*) f and that D and A are constants 
determining the resistance of the pure metal. 



OalvanornagneUc and th&rmomagnetic effects in metals 493 

4. The thermoelectric power 

4* I . Single-band model. At high temperatures a time of relaxation exists, and the 
thermoelectric power in a magnetic field of arbitrary magnitude can readily be 
calculated by means of equation ( 1 ) together with ( 8 ) and I (27), (28). The resulting 
expression is complicated, and owing to its restricted validity it will not be considered 
further here. It is of greater interest to consider the possibility of setting up an 
interpolation formula, analogous to the expressions obtained in I, which can be used 
for all temperatures and magnetic fields. By analogy with the procedure in J, 
§§4 and 6 , such a formula would be expected to depend in some way on an inter¬ 
polation formula for the thermoelectric power of a metal containing a single band, 
corresponding to I, equations (54) and (73) for the electrical and thermal conduc¬ 
tivities. No such formula has so far been given, and it is therefore necessary to begin 
by supplying it here. The formula is obtained by writing down the exact expressions 
for S in the three limiting cases where the solution is known, and guessing a 
generalized expression which reduces to the correct form in these limiting cases. 

At high temperatures it is found, using ( 1 ), (suppressing all terms referring to the 
d-band and omitting the suffix *), together with ( 8 ), I (27), (28) and the expression 
(Wilson 1936 , pp. 208, 223*) , w (2m)iDT 

1 " (I*) 


■ + 


for the time of relaxation, that 

<3 = 


m 4fc 2 f*A 0 

3 (2m)* DT 
ir*lt*T + 8 ft 3 A >£ 2 9 




1 + 


(2 


(13) 




-M 


(14) 


8/i»A/?e*0 

At low temperatures the corresponding expression is 

nWTj (2m)*D(T\'( f 1 / _ 

3£ [ + « , AyJ£*\e/ \ ^ 2n* S'<*.*+/}%, 

which is a remarkable result since it shows an explicit dependence of © on the mag¬ 
netic field, due to the non-existence of a time of relaxation at low temperatures. This 
question is considered further below. 

For an infinitely large magnetic field, finally, 


§ 


© 


n*k*T 


8 ( 2 m)» /TV 

1 + 4ff% 3 A/?d©/ 

(/s + fl 

(!) 

V*/ 5 -£/,)} 

3(2m)‘ (T\ 
1 + 4^ 3 A/?£\0j 

f] 

:*+?( 

TV 

sv 

■(r*-w)} 


(15) 


Now consider the expression 


© 


3(2m)» / 
nWT + 4*WA0C\ 

T\ 3 i 

&) I 

Ki 

(T\ 2 1 

1 , 0 / 1 

(-.*-*£338)} 

K , . 3(2m)« | 
+ 47r 3 ft 3 A/?£' 

T\*l 

@) i 

k + f< 

T)'i 

(2n * ? i a ^ s -( wtc Po)*\) 
( 3 ^H»+(necp 0 )»J j 


( 16 ) 


* Wilson’s notation differs considerably from the one used here. 




494 E. H. Sondheimer 

« 

where n = (2m£ )*/(3;r s ft 3 ) is the number of electrons per unit volume, and where 

1 _ . ZiPbD [T\* „ 

<r 0 = Po = ( 2 m)* e*£ + 2fi*g»A \0/ ( ' 

is the usual interpolation formula for the electrical resistivity of a metal containing 
a single band. It is seen at once that (16) reduces to (15) when H is infinite. At low 
temperatures all terms involving ^ and may be treated as small, and (14) is 
then obtained as the first approximation. At high temperatures, finally, 

(«-!) A-(@/^)’ 1-1 . 

and (13) is obtained on neglecting terms of order (0/T ) 2 compared with the leading 
term. Thus equation (16) may be adopted as the interpolation formula for the 
thermoelectric power of a metal containing a single band. 

The corresponding formula for the adiabatic Ettingshausen-Nernst coefficient 
will also be required. It can be established by similar methods, and it is found that 


1 




[; 


9h*l 


(2 m)t gPT + 47r*£*A 2 A0 


2U m* 


(I)' 


/. + - 


D (Tyt2n* 

W V 




3H 2 + (^cp 0 ) 2 \n 

// a +(wecp 0 ) a /jJ‘ 


(18) 


While the interpolation formulae for (r and k when H ~ 0 are identical with the 
exact expressions for H *= oo (Kohler 1940 , 1941 c), equation (16) shows that this is 
no longer true for (5. It is, however, known from Kohler’s results for H * 00 and 
Dube’s results for H = 0 (Dube 1938 ) that the free electron model dqes lead to 
finite magneto-resistance effects if the calculations are pushed to a higher degree of 
approximation than is considered in the present work. In the case of the thermo¬ 
electric power a corresponding effect shows up in the first approximation, and has 
therefore to be included in the interpolation formula for ©. However, the magnetic 
change predicted by (16) is small and is not in accordance with the experimental facts, 
and is of little importance compared with the effects to be discussed below in con¬ 
nexion with the two-band model. 


4*11. Interpolation formulae for P and Q, The simplest way of obtaining the 
corresponding generalized expressions for the two-band model is to set up the 
interpolation formulae for P and Q. To do this it is only necessary to choose P and Q 
in such a way that equations (1) and (5) lead to (16) and (18), considering a single 
band and using the interpolation formulae I, (55), (56), (71), (72) for V, W, X, T, 
These conditions are sufficient to determine P and Q. They lead to two linear equa¬ 
tions expressing @ and B% N in terms of P and Q t which, when solved, give 


P 


7r*(2m£)*wl 

r ne*p 0 ne*L n T ' 
K ft ft*0 J 


•a*| 

fne*p 0 ne*L n T^ 

l ft ft* 0 t 

}ftci*&v 

SH/Kq 

K7 8 ) 

F 

nh*L\T'\ 

ft**o / 



(19) 



495 


Galvanomagnetic and thermomagnetic effects in metals 

7r*(2 


Q *» « 


3flbe„ 


/ne i p 0 ne*L n T J 

V ft ft^o 'J 

| hcB%y + | 

f ™*Po . ru*L n T 
l # &k 0 1 

1® 

( 

;-r) 

/ nVL%T" 

\ ft*g ; 

) 


where I n = Mnk/e)*, © is given by (16), /> 0 by (17), B% N by (18), and 


9« 4 /? 


27 h 


( 20 ) 


WfWf + (I) V> + f -»/,)} • <»> 


As a check it can be verified by means of elementary calculations, which will not be 
reproduced here, that (19) and (20) reduce to (8) at high temperatures, to (9) and (10) 
at low temperatures and to (11) for very large magnetic fields. 

A complete set of interpolation formulae for V, W> X y T, P, Q is now available, 
and can be used in conjunction with equations (l) to (7) to derive generalized 
expressions for all the galvano- and thermomagnetic effects. 


4*2. Two-band model . The values of © and B% N for the 5-band alone but in an 
arbitrary magnetic field are given by expressions like (16) and (18) but with the 
suffix 8 inserted where necessary, and will be denoted by ©* and B% N 9 respectively; 
also, © 0a and B% N08 are used to denote the values of ©* and B% N 8 when II = 0. In 
writing down the corresponding expressions for the a!-band, it must be borne in mind 
that © # and © rf have opposite signs while B% N 8 and B% N>d have the same sign, as 
is shown by equations (1) and (6). By means of equation (1) together with (19), (20) 
and I, (56), (56), (71), (72) it is found, after much rearrangement and simplification, 
that, for the most general case of two overlapping bands, 


®{(°'0* + <r 0d) 8 + (“) «*} {*o.+*m+(„»**,. T ) ( n 2 + w |) ***- 


b on i , _ \ _ 2 _2 I \ , / 3//e \ n t rift 

**o*U. °* + M + W 0s 0 ") U. { oh M ' {s'Ic'ct) n\v£ **' 0 ‘ , 


+ ^fc K+ Ml 'W ”^r °* #d )U ( o, od) {nWT/ n\n\ 

/B\ ( 3 He_ \/@ 8 d \ /o-q, o-QdW^ <c M \ 

W [nWcTf K J \ n, n d )\n a n d J 


I (T 0» Cr 0d K 0» ,<: 0d 


i fe + ^)fe ( °’“ +<r " )+ (s) 


3e® 

+ n*k*T 

. 1 m „ $%N t d 

+ c Hn ‘-^t 


L \n t n d )\n d 


/ 3 He \*M t —w d 


W(j (<<>, +*w) (^Bcr) n*nl * 8 »*o«jJ <r o. 0 o < t 


+ ^ffe + ^)fe (tr °* +£rw) “(I) (22) 



496 


E. H. Sondheimer 


(Tfa, it,,, j, ATft,, /fft, being given by equations like (17) and (21) but with the suffix a or d 
inserted where necessary. In the absence of a magnetic field this reduces to 


0 _ d'og ®0« O'od 
' ° . O'o. + ^od 


(23) 


4-21. Equations ( 22 ) and (23) make it possible to work out the variation of the 
thermoelectric power with the magnetic field and the temperature in any particular 
case. To get a general idea of the results, various assumptions are introduced which 
help to simplify ( 22 ) considerably. In the first plaoe, the variation with H of ® d , © rf , 
B% N 8 and B% N d is neglected, since it is unimportant compared with the effect due 
to the two-band model, and these quantities are therefore replaced by their values 
for H tm 0 . It is also assumed that <r 08 » - }<r 0 , = J/c 0 , B% NpQ8 = B% NM > 

and © to = — ©od/A. There are two cases to consider according as n 8 is or is not equal 
to n d) but, since there does not appear to be any essential difference between the two 
cases, the discussion will here be confined to the simpler case where n 9 =* n d = \n . 
In this case S 0 = £(1 — A)©^, and 


®-©o 


This may be written in the form 

© = © 


3 H* 

1 n i k % n i c i T <ToKo 
+ \nVncTj 0 


1-1 

H \ 

| 2 A> 

1 ! 

[necp 0 ) 

' L n 

1 + l 

1 H \ 


K necpJ 

T? 


(24) 


(25) 




where L n — ^(nk/e) 2 and L 0 » at 0 /ct 0 T, as in I, § 6 * 2 . 

Thus, if © 0 is positive, © decreases steadily as H increases, changes sign for 
a certain value of H , and in very strong fields it approaches the saturation value 


oo 



(26) 


which is, in general, greater than © 0 in absolute value* (Makinson 1938 ). If © 0 is 
negative, however, © increases as H increases and eventually becomes positive. 
The change in the thermoelectric power is - A@/e, where 


* 


A© * © - © 0 



(27) 


At very low temperatures L 0 — £ n , p 0 - p r (the residual resistance) and @ 0 is pro¬ 
portional to T. HenceA© « OwhenT - 0 , and for small values of I 7 it is proportional 
to T . At high temperatures L 0 * L n) and © 0 and p 0 are both proportional to T t so 

* Equation (22) shows that there is saturation in strong fields also when n,=|=n d , but the 
details of the field variation will be more complicated. 



497 


Galvanomagiietic and therrmtnagnetic effects in metals 

that A© tends to zero like T~~ l as T tends to infinity. In fact, with the magnetic fields 
that are obtainable in practice, necp 0 $>H at room temperature for normal metals, 
so that A ©<© 0 according to (27), which means that the change in thermoelectric 
power will, in general, be too small to be deteoted at room temperature. 

Thus A© is very small at very low and high temperatures, and must therefore 
pass through a maximum (in absolute value) at some intermediate value of T. For 
a more detailed discussion it is sufficient to make the further assumptions that 
L 0 — L n and that © 0 is proportional to T for all temperatures.* Equation (27) then 
leads to the following condition for the maximum of | A© j: 


H = 



(28) 


Now p 0 can never be less than p r \ hence, when H « 0, (28) is satisfied for that tem- 
2 y do 

perature for which —« 1 , that is, for which p 0 is proportional to ^JT over a 
Po dl 

small temperature range. This temperature lies in the region where the residual and 
ideal resistances are of the same order of magnitude (for very pure metals this ipeans 
temperatures of the order of 10 ° K), As H increases, this condition continues to hold 
approximately until H is of the order of necp r \ as H increases further, the condition 
for the maximum is that H and necp 0 must be of the same order of magnitude 

— — t — 1 is always of order unity], and hence the temperature of the 


/ . 2 Ti 

I since- 

\ Po < 




Po dT 

maximum increases with H . 

The qualitative behaviour of j A© | as a function of temperature is therefore as 
follows. | A© | is very small at sufficiently high temperatures, rises to a maximum 
as the temperature decreases and then tends to zero as T approaches zero (linearly 
for sufficiently small T). For small values of the magnetic field, the maximum 
occurs at a temperature of the order of 10 ° K. As H increases, | A© j increases, but 
the temperature at which the maximum occurs remains approximately constant 
until H is of the order of necp t \ for larger values of //, the temperature of the maxi¬ 
mum increases with H (linearly for sufficiently large H). 


4*3. Beryllium appears to be the only metal (apart from the metals of the abnormal 
bismuth group) for which there are any experimental results (Griineisen & Erfiing 
1939 ). The change in thermoelectric power is too small to be detected at tem¬ 
peratures above about 150° K; as lower temperatures it rises to a maximum and 
then decreases towards zero as T tends to zero. The position of the maximum moves 
towards higher temperatures as the magnetic field increases. These results are in 
qualitative agreement with the predictions of the two-band model, but it must still 
be verified that there is also order-of-magnitude quantitative agreement. For one 


* In general, L 0 is less than L n (Makinson 1938 ); also <5 0 is given by (16), (with H = 0), 
and is a linear function of T only at very high and very low temperatures. Taking into account 
the exact temperature variation of L 0 and S 0 changes the quantitative but not the qualitative 
nature of the results. 



498 


E. H. Sondheimer 


of the specimens investigated, the maximum was found to occur at 60° K in a field 
of 12,200 gauss. At this temperature the resistivity of the specimen is about 3 x 10 "*° 
gauBsian units, and (28) therefore indicates that n is of the order of 3 x 10 M , which 
agrees in order of magnitude with the value derived from the magneto-resistance 
effects in I, § 6 * 2 . The maximum change in the thermoelectric power is observed to 
be of the same order of magnitude as the thermoelectric power in the absence of 
a magnetic field, which agrees with (27) and (28). 

5. The transverse effects in zero magnetic field 

5* 1 . Single-band model ivith a time of relaxation . When a time of relaxation exists, 
the coefficients of the transverse effects are readily calculated by means of equations 
(3) to (7) together with ( 8 ) and I, (27), (28). For the present only the case of a 
vanishingly small magnetic field will be considered, the question of the influence of 
H being left until § 6 . The formulae for the simplest model of perfectly free electrons 
are obtained by suppressing all terms referring to the d-band in equations (3) to (7); 


Ah * ——. 
nec 

(29) 

jj 

(30) 

Ben " 3 me ar 

(31) 

T dr 

B ~ncTd£' 

(32) 


r being regarded as an arbitrary function of £. Analogous formulae hold for a metal 
containing a single d-band, except that e has to be replaced by -e, so that A H and 
Brl are positive instead of negative. 

Combining (29) and (30) and using the expression (Wilson 1936 , p. 161) 

n€*r 
m 

for the electrical conductivity, it is found that 

Brl “ Ag(X. * 

Also, introducing A EN * B mn /k * ZmB KN l(n % khtTr), (31) and (32) lead to 

and, combining (34) and (35) and introducing A RL = B bl /k, it is found that 

^■K^RL 3 \e/ 

* It will be recalled that is taken to represent the measured value of the Bttingchanseri- 

Nemst coefficient (see § 2). 


(33) 

(34) 

(35) 

(36) 



499 


Galvarwmagnetic and thermofnagrietic effects in metals 

The relations (34), (35) and (36) have been known for a long time (Bridgman 
1924 ), and Meissner ( 1935 , p. 383) has shown that there is order-of-magnitude 
agreement with the experimental values for room temperature. The experimental 
evidence is considered further in § 7*2. 

Equations (29) to (32) have been discussed by Sommerfeid & Frank ( 1931 ) in the 
special case where the free path is independent of £ (and therefore r is proportional 
to £“'*); the generalization for the case where r is an arbitrary function of £ is due to 
Kohler ( 1941 a). The Sommerfeid & Frank case is realized at extremely low tem¬ 
peratures, where the effect of the impurities is dominant; at high temperatures, 
however, r is proportional to £* (Wilson 1936 , pp. 208, 223; compare also equation 
( 12 ) of the present paper). The formulae for the two limiting cases of very low and 
very high temperatures are thus obtained from (29) to (32) by replacing 9r/S£ by 
“ |t/£ and |r/£ respectively. They are special cases of the generalized formulae 
given in § 5*2 and need not, therefore, be discussed separately. 


5*2. Single-band model: arbitrary temperature. At intermediate temperatures it 
is not legitimate to assume that a time of relaxation exists, and equations (29) to 
(32) cannot be used. By means of the interpolation formulae I, (55), (56), (71), (72) 
for V, W y X, Y and (19), ( 20 ) of the present paper for P, Q it is possible, however, to 
calculate the generalized expressions which are approximately valid over the whole 
temperature-range. The equations which replace (29) to (32) are found to be 


A h - 


nee 1 


(37) 


B 


3 etc n 


RL nWncT* 

B EN - %0 + ^-ji. 


(38) 

(39) 



30q 

?r 2 i;®wc’ 


(40) 


where k 0 is given by ( 21 ), and B% N o and @ 0 are obtained by putting H - 0 in (18) 
and (16), that is, 


B%n.o 




9 h*fi 


27 h 


(2mKJt j r 4jr^A0 




©0 


3(2«i)« 

nW 47r*A 3 A/?£ 

Q 


(1) 

W/j+i/,)} 

X 3(2m)» / 

+ WA 

TV 

.0/ 

K( 

im^)) 


(42) 


It is now possible to discuss the predictions of the free electron model for the 
temperature variation of the coefficients over the whole temperature range. 

Equation (37) shows that the well-known formula for the Hall coefficient is of 
general validity as an interpolation formula (as was to be expected since it does not 


Vol 193* A. 


33 



500 


E« H. Sondheimer 


involve the time of relaxation). A n should therefore be independent of T for all 
temperatures, which is roughly in accord with the experimental results for many 
normal metals (but see §§5*3 and 7*3 infra). 

The temperature variation of k 0 has been discussed by Makinson ( 1938 ). k 0 is 
independent of T at high temperatures, rises to a maximum at very low temperatures 
and then tends to zero as T approaches zero (linearly for sufficiently small T). 
Hence, according to (38), B RL is proportional to T* 1 at high temperatures, but 
increases more rapidly than T ” 1 at low temperatures; finally, when the effect of the 
impurities is dominant, B RL tends to a constant value -(weep,)- 1 , where p r is the 
residual resistance. The alternative coefficient A RL = B rj Jk § is proportional to 
T ™ 1 for all T. 

Comparison of equations ( 21 ) and (41) shows that the expressions for 3*c 0 /(2nc£) 
and B% N q are very similar, the only difference arising from the term in ^ which is 
negligible except for temperatures close to the Debye temperature. For a quali¬ 
tative discussion it will be sufficient to put B% N q » — 3x 0 /(2nc£)-* Further, (42) 
shows that it is possible to write @ 0 « x 7 r 2 FT/f, where x decreases steadily from the 
value 1 at high temperatures to the value § at very low temperatures. 

Equations (39) and (40) now reduce to 

= <«) 
orp 

»)• (« 4 ) 

At high temperatures x = 1 and * 0 is independent of T, so that is positive and 
independent of T, while A E is positive and proportional to T. As T decreases, 
Be N and A s both decrease and beoome zero at that temperature for winch x = 

At extremely low temperatures x = $ and k 0 is proportional to T; hence, as x 
decreases below the value B% N and A s beoome negative, pass through a minimum 
and finally tend to zero as T approaches zero (linearly for sufficiently small T). 
The minimum ooours in the temperature region where the residual and ideal resis¬ 
tivities are of the same order of magnitude. 

The experimental results for the temperature variation of the coefficients are 
very meagre and always refer to high temperatures (except for the Hall coefficient); 
they are considered further in § 7-4. Measurements at low temperatures are essential 
for a complete experimental test of the behaviour predicted by the free electron 
model. 

It is easily seen from equations (37) to (40) that the relations (35) and (36) among 
the coefficients remain valid for all temperatures, but that (34) must, in the general 
case, be replaoed by j 

Brl “ 7 r ^ j 4j<r 0 , (45) 

* This relation is actually exact when H is infinite (since equations (18) and (21) are exact 
when H is infinite). 



501 


Gcdvanomagnetic and therm&magnetic effects in metals 


where tr 0 is given by (17). At sufficiently high temperatures L 0 « L ni but for normal 
metals LJL n may be considerably less than unity over an extended temperature 
range (compare Makinson 1938 , figure 3. Note that his notation differs slightly 
from ours). The experimental evidence relating to equation (45) is discussed in § 7*2. 

5*3. Two-band model . It is now of interest to investigate to what extent the 
results obtained above for a metal containing a single band are modified in the more 
general case of two overlapping bands. It will be found that, although the single¬ 
band model gives the right order of magnitude for the transverse effects,. its more 
detailed predictions regarding the temperature variation and inter-relation of the 
coefficients (especially as regards the low temperature behaviour) cannot be expected 
to hold for real metals except in special cases, and attempts to explain the experi¬ 
mental facts on the basis of this model are not, in general, likely to be successful. 

Retaining the terms which refer to the d-band in the formal expressions (3) to 
(7), it is found that the equations which replace (37) to (40) are 


1 «. 

“K. + O'm)*’ 


(46) 


3e n u 


2 

0 d 


J RL 


% 


nVcT + k m ’ 


Ben — 


I 


^ot+^oa 


L ua , _ pa , 3 /*i 

ti“ ENt o, + <r M B% N ' m + - 2 ^-y — — J ] 


nWcT\ 

. _L q ~0» <r 0d 

'e*c(o- te +<r 0 ^j* 


(47) 


(48) 


A, 


1 


(O’o.+O’od) (««.+ <«) 




e 2 c(cr 0 , + cr 0 rf j a \w 8 'n d J ^ + /c 0(i ’ 

the notation being the same as in § 4*2. 

These equations show that A H and B RL are negative or positive according as the 
conduction is predominantly due to the *-band or the d-band. B l EN and A E are, 
however, always positive at high temperatures, the s~ and d-bands giving contribu¬ 
tions of the same sign. This is due to the fact that, while A n and B RL are odd functions 
of the electronic charge e, B EN and A E are even functions of e, so that conduction 
by ‘positive holes 9 does not lead to anomalous effects in the latter case. There are, 
however, many metals, including the monovalent metals Cu, Ag, Au, which have 
negative Ettingshausen and Ettingshausen-Nemst coefficients at high tempera¬ 
tures. This is clearly a more complicated effect than the occurrence of the positive 
Hall and Ri^hi-Leduc coefficients, and some more general model than those so far 
considered would be necessary for its elucidation. The type of model required is 
indicated by equations (31) and (32), which show that 3$# and A E are proportional 



502 


E. H. Sondheimer 


to the energy derivative of the time of relaxation, and are therefore negative 
provided that r is a decreasing function of the energy at the surface of the Fermi 
distribution.* No model which has this property at high temperatures has so far 
been proposed; in addition to giving negative values for B EN and A E , it would 
presumably explain the positi ve thermoelectric powers of Cu, Ag and Au. 


5*31. The temperature variation of the coefficients predicted by equations (40) 
to (40) is, in general, more complicated than that predicted by the single-band model. 

Writing cr 0s =* per 0 , <t m — (1 —^>) rr 0 , the expression (46) for the Hall coefficient 
becomes 


From (50) it follows that 


* ec\n 9 n d \ 

+ Iz £) <0 

■ dp ec \n 8 n d ) 


(50) 


hence A n is smaller (algebraically), the larger the value of p, i.e. the larger the pro¬ 
portion of the total electric current which is carried by the 5 -band. Temperature 
variations of A u can be accounted for by allowing p to vary with the temperature, 
and such an argument has been used by Jones ( 1936 ) in his discussion of the Hall 
effect in bismuth. However, if one aocepts equation (17) for the electrical conduc¬ 
tivity, any temperature variation of p is unlikely to be very marked and should in 
any case set in only at very low temperatures where the residual and ideal resistivities 
are of the same order of magnitude. The present model might therefore be used, at 
best, jto explain a change in A n by a factor of (say) 2 at very low temperatures, 
Jones’s argument requires a larger variation of p than seems reasonable according to 
the present considerations, and some other mechanism appears to be necessary to 
account for his assumptions. 

The Hall coefficients of many normal metals increase in absolute value as the 
temperature is lowered, but become constant at extremely low temperatures. The 
experimental data are considered in detail in § 7-3. 

Similar considerations as to A n apply to but here the temperature depend¬ 
ence predicted by the free electron model (equation (38)) is probably more important 
than any effects due to the two-band model, and no significant new effects are to 
be expected. Equation (38) is therefore probably of wider validity than its derivation 
would suggest, and may be regarded as an approximation formula for the tem¬ 
perature variation of the Righi-Leduc coefficient of all metals. The detailed behaviour 
of B rl as a function of temperature is obtained from (38) and Makinson’s (1938) 
discussion of the thermal conductivity; it has been considered qualitatively in § 5'2. 
Measurements on a variety of metals at low temperatures would be required to 
verify the general validity of equation (38), 

* In deriving equations (31) and (32) it has been assumed that d 2 E/d\k\ % is positive. The 

. d / e*E \ 

more general criterion for anomalous values of ty t9 and A g is that — | r rrrTi 1 should be 

off* \ &\1& ("/ 

negative. 



503 


Galvanomagnetic and thermomagnetic effects in metals 

For a qualitative discussion of and it is sufficient to put « <r M - $<r #) 
K o» ~ K 0d = K- n s B BN,o» - k&Ben.m = -%KoK*c£)’ and ©o# = “©m = xit*k*T 
where i < 1 (compare §5-2). Writing also L n = J(rrfc/e)*, L 0 = kJ<t 0 T, equations 
(48) and (49) reduce to 



Apart from the fact that - is replaced by * (— + — |, equations (51) and (52) differ 

n * 4 \n H n d f 

from the corresponding equations (43) and (44) for free electrons in having the 
additional term xLJL 0 . The temperature variation of L 0 has been discussed by 
Makinson (1938; see also I, § 6*2). For normal metals L 0 is equal to L n at both high 
and very low temperatures but may be considerably less at intermediate tempera¬ 
tures. For metals such as bismuth, however, which have an abnormally small 
number of free electrons, L 0 is in general greater than L n at intermediate tem¬ 
peratures. At high temperatures, therefore, the sign and temperature variation of 
B EN and A E are the same as for free electrons, only the magnitude of the coefficients 
being different. The behaviour is, however, entirely different at low temperatures, 
where the free electron model predicts that B l EN and A E change sign. Equations 
(51) and (52) show that B EN and A n are only negative when the condition 


JL 

L 0 2x 


1 


(53) 


is satisfied. Now J ^ x ^ 1, so that ; hence B* KN and A E are positive 

£%JG 

at high and also at very low temperatures, where L 0 — L tl . Also (53) can never hold 
if L n , and therefore an( l A E are always positive in this case. If L 0 ^L n , 
however, it is possible that B l EN and A E may become negative in some intermediate 
temperature range. The detailed behaviour is, in general, complicated, but it can 
be worked out from (51) and (52) for any particular case. 

Finally, equations (46) to (49) show that the simple relations (36) and (45) derived 
for free electrons are only valid for the two-band model in the special case where it 
is legitimate to assume that a time of relaxation exists (and where (45) reduces to (34)). 
Thus the theoretical predictions for free electrons appear to be generally valid 
at high temperatures, but cannot hold for multivalent metals at low temperatures 
except (at best) as order-of-magnitude relations. On the other hand, comparison 
of (48) and (49) shows that the relation (35) between the Ettingshausen-Nernst 
and the Ettingshausen coefficient continues to be valid for the two-band model at 
all temperatures, and is thus of a more general type than (36) and (45). 

These results of the electron theory concerning the relations among the coefficients 
are in accordance with the thermodynamic arguments given by Meissner (1935, 



504 


E. H. Sondheimer 


p. 316). The three relations (34), (36) and (36) are derived there, but the argument 
leading to (34) and (36) involves certain assumptions which are equivalent to 
assuming the validity of the Wiedemann-Franz law. Now this law is known to be 
a consequence of the existence of a time of relaxation (Wilson 1936 , p. 176; I, p. 461),. 
and thus Meissner’s derivation can only be valid for high temperatures. The deriva¬ 
tion of (36), however, appears to involve purely thermodynamic arguments only, 
and this relation should therefore be a general property of all models at all tem¬ 
peratures. There are unfortunately no experimental results with which to check 
the validity of (35) at low temperatures; for the room temperature data regarding 
the relations among the coefficients, see § 7*2. 


6. The field variation of the transverse effects 


6 * 1 . The free electron model leads to values for the coefficients of the transverse 
effects which are independent of H for all values of H (at most, there is a negligible 
effect of the type (18)).* This is because the field variation of the transverse effects, 
like the magneto-resistance effect, depends essentially upon an average of the 
departure of the energy surfaces from being a singly-connected set of spheres, and 
the two-band model is therefore the simplest model which can be expected to give 
results of the correct order of magnitude. The generalizations of equations (46) to 
(49) for the case of a magnetic field of arbitrary magnitude are readily found to be 


A#--- 
ec 


n. n d ' 

f Hyn s -n d 
i ec) n\n\ <r *‘ r « 

(c r 0 , + (r 0d ) a -f | 

k ee) n\n\ <r ** r “ 


n _ 3 ^ 

RL n*k*cT 


*1* K la , l \ 2 n 8 -n d 
n, n d _ \Tr*k*cTJ w»w| M 


** + *<w + („ afcicr ) ( n j + n |)*o.*i 


(54) 


( 55 ) 




3 He 


ntybigCT) 




4 - ft similar term with s and d interchanged and the sign of © changed, ( 66 ) 

* This is actually true only for effects measured under adiabatic conditions. The isothermal 
effects for free electrons in general depend explicitly on H even when a time of relaxation 
exists (Kohler 1941a). Thus, for example, formula (66) for does not become independent 
of H on putting n t = 0 or n d = 0. Even here, however, the major part of the field variation is 
due to the departure of the energy surfaces from being a singly-connected set of spheres, 
and the predictions of the free electron model cannot be in agreement with experiment. 



Oalvanomagnetic a/nd thermomagnetic effects in metals 505 


4* a similar term with 8 and d interchanged and the sign of @ changed. (57) 


6 * 2 . Equation (54) is the generalization of the corresponding expression involving 
the times of relaxation which is only valid at high temperatures, and which has been 
used by Jones ( 1936 ) to discuss the Hall effect in bismuth.* The two cases n 8 = n d 
and n 8 + n d will be considered separately. 


Case (i). n 8 =» n d « \n 

In this case equation (54) reduces to 

A _^ cr os~ (T od 

H neccr^ + arM 

and A h is therefore independent of H for all values of H . 


(58) 


Case (ii). n 9 ^n d 

Writing, for simplicity, cr^ = cr M 


£cr 0 , (54) becomes 


X H 


S£ 

I 

as 

£ 

il 

(W 

yec) 

l 2 rrl 

°o 

' n a n d 

n . n <t , , 1 l 

10 l 


1 *K—n d ) s 

1 « 


(59) 


This equation shows that, as H increases, the absolute value of A H decreases at first 
(the change being initially proportional to H 2 ), and reaches the value zero when 
H — 2(w tf TOa)*ec/> 0 . As H increases further, A n changes sign and increases steadily 
in absolute value, and for very large fields it approaches the limiting value 


( a h)h-*> ec(n,-n d y 


( 06 ) 


which is independent of the electrical conductivity. 

The most extensive experimental work on the field dependence of A H has been 
carried out on bismuth and its alloys, which show very marked effeots even at room 
temperature, and which can be discussed by the formal theory (for anisotropic 
metals) already given by Jones. The sign reversal predicted by equation (59) has 
been obse^ed in bismuth-lead alloys for a certain range of compositions (Thomas 
& Evans 1933 ; see also Jones 1936 , p. 663). 

The data for the normal metals, which are very scant and unsatisfactory, are 
discussed in § 7-3. 

* Jones’s formula is more general than ours in that it applies to anisotropic metals; as was 
pointed out in § 1-2, however, this is a purely formal generalisation of the theory. . 



506 £. H. Sondheimer 

6-3. For the Righi-Leduo effect there are again two cases to consider, aooording 
as n t is or is not equal to n d . 

Case (i). n s = n d = \n 


In this case 


Brl — ~ 


6e 




v 0tZ 


nVncT 


1 / 3 He \* 

i+ i¥kw) y 


( 6 i) 


Od 


which shows that the absolute value of B nL decreases steadily as H increases. There 
is no saturation in strong fields, B RL being proportional to for large values of H . 

Case (ii). n a ^n d 

Writing, for simplicity, = tc od ~ £/c 0 , it is found that 


B: 


RL 


3 €K 0 


1 

n,-n d 

u 

( 3 He. 

\*A_ 

4 ' 

[nVcT 

J n » n s 

l+i| 

3 He ) 

n 


(n*k*cT) 


( 02 ) 


The behaviour is therefore similar to that of A u , As U increases, B RL decreases at 
first in absolute value, changes sign when H = 2n 2 (n a n d ) k & a cT/(3e/c 0 ), then increases 
in absolute value and eventually tends to the limiting value 

3e/c 0 


(Brl)h~ 


n a - n* 


(63) 


2n 2 k 2 cT wj 4* ' 

6*4. Since there are no experimental results on the field variation of B% N and 
A e and since the formulae are very complicated, it is not worth while giving sucli 
a detailed discussion as for A n and B RL . The following results give some idea as 
to the possible behaviour of the coefficients. Considering first N , neglect the 
if-variation of B% NtU , B% Ntdi <3 S and © d , and, as in § 5*3, put <r 0 * * <r 0d » 

K 0> m *(W * i'fo. 

«■» B%n,s = n d B% Nd = - 3* 0 /(4c£) and S. = - = xnWTjt, 

where I. 

Case (i). n„ - n d = \n 

Writing L n = ^(rrk/e) 2 , X 0 = K 0 j(r 0 T, p 0 = l/<r 0 , equation (56) reduces to 


Bi 


'BN ' 


L n 

Sk 0 X L 0 +X 

»4( 

' H ) 
necpj 

i 2 X y 

} _E 

** 1+1 

H ' 
Sleepy 

\*L* 0 

1 x* 


(64) 


(Bbn)h- 


(65) 


3<o X„ _ 

2nc^L 0 ‘ 

It ia readily shown that dB^IdH % 0 according Hence, at high temperatures, 

as H increases from zero to infinity, B* EN decreases steadily from the value (51) to 
the value (05); at sufficiently low temperatures, however, Bb N increases as H 
increases. 



507 


Oalvanomagnetic and thermomagnetic effects in metals 
Case (ii). n g ^n d 

In this case the behaviour of Be N is complex, but for large values of H 

Di 27r*fc 4 cT* | n.n d L 0 1 IMgtx 

& BN - 36^/c 0 H a(ris + Wd) r i + K-nd)*d’ (66) 


there being no saturation. 

6*5. To discuss A St the same approximations as in § 6-4 are sufficient. 

Case (i). n 8 * n d » \n 
Equation (57) reduces to 


A 

K ncg 




(67) 


so that A e increases steadily with H , the increase being proportional to H 2 for all 
values of H . 


Case (ii). n H + n d 

In this case A E saturates for large H , the saturation value being 




n H + n d 


{*-$+ 


A;] 

K~w d ) 2 L,j' 


( 68 ) 


It is not at present possible to compare the theoretical results of §§ 6*3 to 6-5 with 
observation, owing to the lack of sufficient experimental data. Experiments at low 
temperatures on metals whose electrical resistance does and does not saturate in 
strong fields are desirable to determine to what extent the behaviour of real metals 
is in agreement with the predictions of the two-band model. 


7. Survey of the exi’ekimental results 

7*1. It has already been pointed out in § 4*3 that the only reliable measurements 
on the change of the thermoelectric power are for beryllium. These show that the 
effect predicted by the single-band model is much too small, but that the results 
given by the two-band model are correct as regards order of magnitude. 

The data on the transverse effects have been referred to in passing in the preceding 
sections. In the present section the results available are collected together in order 
to see what conclusions can be drawn from them. The data have been obtained 
(unless otherwise mentioned) from the survey articles by Borelius ( 1935 ) and Meissner 
( 1935 ), where references to the original experimental papers will be found. Ferro¬ 
magnetic metals have been excluded from the discussion. 

7*2* The relations among the coefficients. As was remarked in §5*3, the relation 
(35) is a consequence of pure thermodynamics and should therefore be of general 
validity. Table 1 contains values of the ratio A S /A EN T derived from the two sources 
mentioned above. All the values shown refer to ordinary temperatures; there are 




508 E. H. Sondheimer 

as yet no data at all for low temperatures. According to (35), the quantity shown in 
table 1 should have the value unity for all metals. It is evident from the discrepancies 
between the two values obtained for one and the same metal that, while there is 
order-of-magnitude agreement with the theoretical value, the experimental accuracy 
is quite insufficient to allow any quantitative conclusions to be drawn regarding the 
extent and significance of the deviations from (35). (The discrepancies are probably 
to some extent due to the fact that, since the directly measured Ettingshausen- 
Nernst coefficient is B EN and not A a knowledge of the thermal conductivity k 
is required for compiling table 1, and k has not, in general, been measured on the 
same specimen as was used for measuring the transverse effects.) Thus a detailed 
experimental confirmation of (35) will have to wait until the experimental accuracy 
has been improved (and the measurements have been extended to cover the whole 
temperature range). 


Table 1 . Values of A E !A EN T (to verify equation (35)) 


a b/ a en T from A e/ a en T from 


metal 

Borelius, 
p. 431 

Meissner, 
p. 383 

metal 

Borelius, 
p. 431 

Meissner, 
p. 383 

Ou 

1*05 

0*76 

Pd 

1*23 

0*82 

Ag 

1*22 

1*34 

A1 

1*43 

1-73 

Au 

0*07 

0*07 

Bi 

0*94 

2*88 

Zn 

1*37 

4*25 

As 

0*98 

_ 

Cd 

0*75 

7*0 

8 b 

1*25 

0*06 


Table 2 . Values of B RL jA H crQ and L 0 /L n (to verify equation (45)) 

Bri^b 0 "o from LJL n from 

^ _— - .—. 


metal 

Borelius, p. 431 

Meissner, p. 383 

Borelius, p. 379 

Meissner, p. 189 

Ou 

0*78 

0*05 

0*92 

0*93 

Ag 

0*87 

0*40 

0*95 

0*97 

Au 

1*03 

0*89 

0*96 

0*96 

Zn 

0*85 

0*70 

0*95 

1*02 

Cd 

0*74 

1-24 

0*99 

0*98 

Ir 

0*09 

— 

1*02 

1*02 

Pd 

0*59 

0*02 

1*00 

1*00 

Pt 

1*9 

— 

1*03 

1*03 

AI 

0*09 

0*49 

— 

0*89 

Bi 

2*4 

0*035 (?) 

1*35 

M to 1*9 

As 

— 

0*26 

— 

_ 

8 b 

0*39 

0*30 

— 

1*1 to 1*6 

W 

— 

0*09 

1*25 

1*18 

Mo 

— 

0*74 

1*07 

—- 


Table 2 shows some values of the ratios B RL /A g (r 0 and LJL n for room tempera¬ 
ture. These ratios should be equal acoording to equation (45), while at sufficiently 
high temperatures (45) should reduce to the old relation (34). The discrepancies 



Qtdvanomagnetic and thermtmagnetic effects in metals 509 

between the two values of B^IAj^Cq obtained for any one metal are to be contrasted 
with the good agreement in the case of L 0 jL n and show clearly that the experimental 
data on the transverse effects must be treated with grave suspicion. The experimental 
accuracy is obviously insufficient to distinguish between (34) and (45) at room 
temperature or to verify or reject either; and to detect any difference between (34) 
and (45) one would have to go to temperatures where L 0 fL n is very different from 
unity. It must be remembered, however, that (45) is no longer valid for the two-band 
model at low temperatures, so that quantitative agreement may be expected, at 
best, for the monovalent metals only. 

Equation (36), which requires the ratio A H A EN j(A E A RL L n ) to be unity, is valid 
at all temperatures for free electrons, but only at high temperatures for the two-band 
model. The experimental data for room temperature are shown in table 3, and it is 
again clear that only order-of-magnitude verification is possible. 

Table 3. Values of A H A EN j(A E A RL L n ) (to verify equation (36)) 


AbAenKAbAriL") from AbAbbKAbAriL^ from 

* - - - - A 


metal 

Borelius, 
p. 431 

Meissner, 
p. 383 

metal 

Borelius, 
p. 431 

Meissner, 
p. 383 

Cu 

1*09 

1*86 

Pd 

1*56 

1*17 

Ag 

0*34 

1*56 

A1 

0*98 

0*90 

Au 

1-45 

1*62 

Bi 

— 

1*15 

Zn 

0*78 

3*2 

As 

— 

5*7 

Cd 

1*69 

1*06 

Sb 

— 

4*1 


The present position as regards the relations among the coefficients may be summed 
up as follows. The old relations (34) to (36) had previously been derived only for 
free electrons in the limit of high temperatures. The present theory extends the 
treatment in two ways, first by using a more general model and secondly by con¬ 
sidering the whole temperature range. At ordinary temperatures the old relations 
are found to be effectively unaltered, and the value of the present treatment lies in 
increasing our confidence in their general validity for real metals. New results are 
obtained for low temperatures, equation (34) being replaced by (45) for free electrons, 
and both (34) and (36) being replaced by more complicated expressions for a metal 
with two overlapping bands; even at low temperatures, however, the new results 
do not differ from the old ones as regards orders of magnitude. The experimental data 
available at present all refer to high temperatures and are not sufficiently reliable 
to be compared in detail with the theory, but there is order-of-magnitude agreement 
for all metals. To verify the low temperature predictions experiments of greatly 
improved accuracy are necessary, care being taken that all the effects are measured 
on the same specimen. 

7-3. The Had effect in normal metals . There are few reliable measurements of 
the Hall coefficients of normal metals at low temperatures or in strong fields. Table 4 



510 


E. H. Sondheimer 

\ 

oontains soma values of A u for various pure metals at different temperatures (but 
in constant magnetic field). As regards the field variation, the values shown in 
table 4 are found to be fairly accurately independent of H, even at low temperatures, 
for fields up to about 10,000 gauss (cadmium is exceptional in showing a somewhat 
complicated dependence on H whioh may be due to experimental error). 

Table 4. Hall effect in normal metals at different temperatures 
— Ag x 10 s5 (gauasian units) from Meissner, pp. 338-340 


metal 

290° K 

20*3° K 

14*6°K 

Cu 

5*5 

7*3 

7*3 

Ag 

8*9 

11*3 

1M 

Au 

8*1 

10*9 

10*9 

Pd 

7*6 

15-4 

15*5 

Cd 

— 6*1 

-22*4 

-24*8 


With the possible exception of cadmium, the temperature variation shown in 
table 4 and the absence of any field variation could be explained by using the 
expression (58) for the Hall coefficient of the two-band model and assuming that the 
relative proportions of the electric current carried by the two bands may vary with 
the temperature (compare the discussion in §5*3). For palladium, for example, 
which is a transition element and which is known from magnetic evidence to have 
05 electrons per atom in the $-band and the same number of vacant levels in the 
d-band (Wilson 1939 , p. 76), the value of p (the proportion of the total current which 
is carried by the *-band) required to account for the observations is 069 at room 
temperature and 0*88 at very low temperatures; such values are by no means un¬ 
reasonable. For the monovalent metals, however, to which the two-band model is 
least applicable and where the temperature variation of A H is in any case not very 
pronounced, It is better to use equation (37) of the free electron model (which requires 
A u to be always independent of T and H) as an approximation formula, and to 
ascribe the observed temperature dependence to some as yet unknown mechanism. 
The results for cadmium (if genuine) also seem difficult to fit in with the predictions 
of the two-band model. 

There is an almost complete lack of reliable data for liquid helium temperatures, 
but one interesting result refers to tin, whose Hall coefficient is very small and 
negative at ordinary temperatures but becomes positive at extremely low tem¬ 
peratures. This behaviour is readily interpreted by means of equation (58) if it is 
supposed that p is just greater than \ at high temperatures but just less than £ at 
low temperatures. 

In general it may be said that the present theory gives, in most cases, qualitatively 
correct results for the behaviour of A n over the whole range of temperatures and 
magnetic fields, and gives detailed agreement in some special cases. However, no 
complete survey will be possible until much additional experimental evidence has 
become available, particularly for low temperatures. 



Galvanomagnetic and thermomagnetic effects in metals 511 

p* 

7*4, The temperature variation of the transverse effects . The remaining coefficients 
have been measured at ordinary temperatures only* Table 5 contains the results of 
measurements by Hall ( 1925 , 1937 ) on all four effects (the same table has been given 
by Meixner ( 1939 )). At temperatures above the Debye temperature all the quan¬ 
tities shown in table 5 should be independent of T. This follows from our discussion 
of the one- and two-band models in § 5, but a general proof for the anisotropic case 
has been given by Meixner ( 1939 ), and the result may henoe be expected to be a 
general one for all metals. The table shows that the observed temperature depend¬ 
ence agrees with the theoretical predictions within the probable limits of experi¬ 
mental error, both for the monovalent metals Au and Cu and for the transition 
element Pd (a curious exception, however, is A E (T for Au). 


Table 5. The temperature variation of the transverse effects at high temperatures 

Gaussian units 

temoerature __ 

motal 

(°K) 

10»\4 h 

WTBja 

i0 u B BN 

10 *A S /T 

Au, © 2 :175 

345 

-7*73 

— 

— 

- 0-39 


330 

— 

— 8*68 

- 6*03 

— 


298 

— 

— 8*94 

- 6*03 

— 


290 

-7-83 

— 

— 

- 1-38 

Cu, ©~315 

358 

-6-98 

— 1*29 

- 6*7 

- 1*60 


328 

— 5*97 

-1-32 

- 7*0 

- 1*57 


298 

-5*96 

— 1*45 

- 7*2 

- 1*60 

Pd, 0-270 

340 

_ 

-1*29 

+ 11*2 



337 

-9-38 


— 

+ 20*3 


318 

-9*50 

-1*33 

+ 10*9 

+ 18*7 


299 

— 9*39 

— 

.... 

+ 18*0 


298 

— 

-1*45 

+ 10*9 

— 


While, at ordinary temperatures, the present theory merely serves to confirm 
Meixner’s general predictions for two special cases, it provides the first theoretical 
results for the low temperature region. These have been discussed in § 5; they show 
that the temperature variation is, in general, more complicated than at high tem¬ 
peratures and that there are characteristic differences in the behaviour of the one- 
and two-band models, especially as regards B EN and A E . (It must be remembered, 
however, that no model has yet been worked out for which B ES and A E are negative 
at high temperatures, and it is hence dangerous to place too much relianoe on the 
temperature variation predicted by our formulae when considering, e.g., the noble 
metals). An extension of Hall’s measurements to temperatures well below the 
Debye temperature (and to other mono- and multivalent metals) may be expected 
to lead to interesting results, but further discussion must be postponed until such 
measurements have become available. 

The author wishes to thank Mr A. H. Wilson for suggesting this work and for 
much help with the preparation of the manuscript. 




512 E. H. Sondheimer 

d ■ 

ReMBBN'OHS . 

Borelius, G. 1935 Handbuch der MetaUphymk, vol. 1*1, part II: 4 Grundlagen des metalliachen 
Zustandes. Physikalische Eigenschaften*. Leipzig: Akademisohe Verlagsgesellaehaft 
M.B.H. 

Bridgman, P. W. 1924 Phys. Rev, 24 , 644 . 

Dube, G. P. 1938 Proc . Camb. Phil. Soc. 34 , 669 . 

Gruneisen, E. & Erfling, H. D. 1939 Ann. Phye., Lpz (6), 36 , 367 . 

Hall, E. H. 1925 Phye. Rev . 26 , 820 . 

Hall, E. H. 1937 Proc . Ato. Acad. ITas6. 23 , 600 . 

Jones, H. 1936 Proc . ftoy. £oc. A, 155 , 663 . 

Kohler, M. 1940 Ann. Phye., Lpz., (6), 38 , 283 . 

Kohler, M. 19410 Z. Phye. 118 , 37 . 

Kohler, M. 19416 Z. Phye. 118 , 312 . 

Kohler, M. 1941c Ann . Phye Lpz., ( 5 ), 40 , 1. 

Makinson, R. E. B. 1938 Proc . Camb, Phil, Soc . 34 , 474 . 

Meissner, W. 1935 Handbuch der Experimentalphyeik, vol. 11 . 2 , part II: ‘ Galvano- und 
thermomagnetische Efifekte \ Leipzig : Akademisohe Verlagsgesellsohaft M.B.H. 
Meixner, J. 1939 Arm. Phye., Lpz., (6), 35 , 701 . 

Peierls, R. 1931 Ann. Phye., Lpz., (6), 10, 97 . 

Sommerfeld, A, & Be the, H. 1933 Handbuch der Phyeih ( 2 nd ed.), vol. xxrv/ 2 , chapter 3 : 

4 Elektronent-heorie der Met all©’. Berlin: Julius Springer. 

Sommerfeld, A. & Frank, N. H. 1931 Rev . Mod. Phye. 3 , 1. 

Sondheimer, E. H. <te Wilson, A. H. 1947 Prog. Roy, Soc . A, 190 , 436 . 1 
Thomas, W. R. & Evans, E. J. 1933 Phil. Mag. 16 , 329 . 

Wilson, A. H. 1936 The theory of metale. Cambridge University Press. 

Wilson, A. H. 1937 Proc. Camb. Phil. Soc . 33 , 371 . 

Wilson, A. H. 1939 Semiconductore and metals. Cambridge University Press. 


Measurements in the reaction zone of a Bunsen flame 
By H. Klaukens and H. G. Wolfhard 
(i Communicated by Sir Alfred Egertrn, Sec.B.S.—Received 7 November 1947) 

[Plate 13] 

It is possible to make direct measurements in the reaction zone of a Bunsen dame by 
using a 4 low-pressure flame’, because the flame is then drawn out and enlarged. The 
temperature-course through the flame front is measured. The position of the Sohlieran 
picture is deduced and may be compared with the actual photographed Sohlieren picture. 
The heat flow in the reaction zone is calculated; it is found responsible for the formation 
of a considerable preheating zone. 


1. Introduction 

The behaviour of an ordinary Bunsen flame burning with acetylene and oxygen 
or air at low pressure was described in a former publication (Wolfhard 1943 ). It 
. was shown that it is possible to bum a mixture of acetylene-oxygen or of acetylene- 
air down to a few mm. Hg pressure, using a burner of large diameter in a dosed 



513 


» Measurements in the reaction zone of a Bunsen flame 

vessel. The diameter of the burner must be increased proportionately (figure 1) in 
order to obtain a stable flame as the pressure (P) is lowered. The linear extension 
(6) of the flame is approximately 6 ~ 1/P in comparable regions of stability, as is 
seen in figure 1. The regions of stability come together at a certain minimum gas 
flow which depends on the pressure. In the lowest part of the stability region, the 



Figure 1 . Stability regions of the C,H,-air flame for stoiohiometric mixture. Ordinate: velocity 
of the oold mixture in the burner pipe. The numbers in the stability region are the mass-flow 
of C,H,. <j> is the diameter of the burner pipe. 




•*— inner cone 
( flame front) 

■*— top of burner 


C 2 H 2 -0 2 CoHo-air flame 
flame 


Figure 2. The shape of a flame near the ‘minimum point’: pressure 20mm. Hg. 


flame has an unusual appearance and oonsists of the reaction zone with the hot 
gases above it. The outer oone does not exist because the flame is separated. The 
inner cone lies like a flat disk on the top of the burner (figure 2). It is a very 
difficult problem to make optical measurements in the reacting zone as long as the 
inner oone remains oone-Bhaped. If, however, the reaction zone is effectively a flat 
disk, then the problem beoomes very simple because different parts of the reaction 



514 EL Klaukens and H. G. Wolf hard 

zone do not overlap. The reaction zone which has at 1 atm. a thickness of 
about jfo mm. grows bigger when the pressure diminishes. The thickness d is also 
nearly proportional to 1 /P, so that at 2 mm. Hg we have 1 cm. (the thickness 
depends also on the composition of the mixture). Such a thick and flat reaction 
zone is very favourable for the study of its physical and chemical properties. 

The reaction zone is visibly not homogeneous. With a stoichiometric mixture it 
gives, starting from below, a weak green light corresponding to the C 2 -bands; in 
the middle of the zone the radiation consists essentially of the strong C % and CH 
bands, giving a nearly white light; at the top end of the zone the C fi -bands disappear 
and the CH bands are dominant, giving a blue corona (figure 3). The thickness for 
an acetylerie-air flame is somewhat greater than that for an acetylene-oxygen 
flame, and the diameter of the burner on which the flame must bum is also bigger, 
hence, at a given pressure, the aoetylene-air flame gives a thicker optical layer. 

blue (weak radiation) 
white (very bright) 

green (weak radiation) 


Figube 3. The zonea of a CjHj-O, flame near the * minimum point*. 



2. The position of the Schlibren picture 

Van der Poll & Westerdijk ( 1941 ) published a Schlieren photograph of a 
propane-air flame burning at 1 atm., which showed that the Schlieren picture and , 
the picture of the ‘luminous cone 1 are not the same: it lies distinctly inside the 
luminous cone. This result was unexpected and important. The Schlieren image 
must be situated where the gradient of the refractive index (n) is greatest. The 
refractive index is ohanged in the reaction zone by the rise in temperature as well 
as by the change of the components. The temperature rises from about 300 to about 
3000° K for an acetylene-oxygen flame. The density after the reaction is therefore 
iV of the previous density. The refractive indices of different gases such as CglJ*, 
0 4 , CO, CO a , H a O, vary from (n- 1 ). 10 * « 257 to 600 at 1 atm. and 0 ° C. It can 
only be decided by experiment whether the change of composition or temperature 
is having the predominant effect in the different parts of the reaction zone. In any 
case, something important is happening in front of the inner cone before.the 
moleoules begin to radiate. 



Measurements in the reaction zone of a Bunsen flame 515 

It was necessary first to confirm that this result was not only a property of the 
propane-air flame, and it was shown that the interesting acetylene low pressure 
flames already described behaved similarly. At low pressure the absolute change 
of (» — 1 ) in the reaction zone is very small, therefore a very sensitive arrangement 
was necessary (figure 4), The distance from the slit to the Schlieren blind was about 
10 m. The whole light beam was shielded by a paper tube to avoid movements of 
the air. Observation was easy for acetylene-air flames and propane-air flames, but 
the light emitted from the acetylene-oxygen flame at 1 atm. was so strong, that the 
Schlieren light, even from a big mercury high-pressure lamp, was scarcely visible. 
Therefore, for the acetylene-oxygen flame, Schlieren photographs were taken only 
with an excess of oxygen, so that radiation was reduced. Another difficulty was 
that the biggest burner for a laminar burning acetylene-oxygen flame at 1 atm. had 
about 1 mm. diameter, so that the picture of the cone became very small. The two 
images were usually photographed together, the Schlieren blind being arranged to 
be drawn from above. 


Hg-high-pressure 

*f m P slit 

frfrh- 

1 / 

filter which only 
lets pass the 
green Hg-line 


Figure 4. Arrangement for the photograph of the Schlieren picture. 

Description of the results 

( 1 ) Acetylene-air flame 

(a) At 1 atm.: The Sohlieren image always lies far ahead of the luminous cone, 
even with excess of oxygen or acetylene: in figure 5, plate 13, the inner cone 
is the Schlieren image. The distance from the intensity maximum of the Schlieren 
image from that of the luminous cone becomes a little smaller when the amount 
of acetylene is decreased. 

(b) At low pressure: Below 40 mm. Hg the density, and therefore the absolute 
change of (»— 1 ), is too Bmall for a Schlieren image to be obtained without extreme 
sensitivity. Figure 6 , plate 13, shows an inner cone at about 80 mm. Hg. For a flat 
cone at 40 mm. Hg the Sohlieren image lies about 7 mm. in front of the luminous cone. 



VoL 193 , A. 


34 




516 


H. Kiaukens and H. G, Wolfhard 


(2) Propane-air flame 

Propane-air flames always show a big separation between the two images. 

(3) Acetylene-oxygen flame 

(a) At 1 atm.; The above-mentioned difficulties prevented photographic work, 
so that only visual observation was possible. With excess oxygen, the Sehlieren 
image was inside the cone; the separation was, however, essentially smaller than 
for an air flame. 

( b ) At low pressure; It was not possible to photograph both cones together, 
because they partially overlap, although the Sehlieren image lies somewhat below. 
It is easy to explain why the separation between the two images is only observable 
with difficulty for an oxygen flame in comparison with an air flame. The rise of 
temperature in the reaction diminishes the refractive index. This is, however, 
counterbalanced by the increase of the refractive index if intermediate organic 
compounds with higher refractive index are present at the beginning of the reaction. 
In an air flame, however, the (n — 1) ourve is caused almost solely by the temperature, 
because the amount of N a present in the air does not permit so much absolute 
change of the refractive index by intermediate components which must have a con¬ 
siderable lower concentration. The change of the refractive index by the temperature 
is however nearly the same as in an oxygen flame, because the ratio of the temperature 
of the cold mixture and the hot gases is in both cases nearly the same. 

The separation of the Sehlieren image and the luminous cone is, therefore, 
a property common to acetylene and propane flames and probably to all hydro¬ 
carbon flames, indicating that a rapid rise of the temperature must begin appreciably 
before the luminosity arises. 

In the Sehlieren cone the refractive index must become smaller in an upward 
direction, because the light is refracted downward, as is indicated by the faot that 
the Sehlieren blind must be drawn from above. The temperature effect is therefore 
predominant, 

3. The temperature distribution in the reaction zone 

The thick and flat nature of the reaction zone of flames burning at a few mm. Hg 
allows a reasonably accurate measurement of the rise in temperature by means of 
a thermocouple. It would not be expected that at these low pressures the reaction 
will take a completely different course because the normal flame velocity is the 
same in the whole region between 760 and a few mm. Hg for an acetylene-oxygen 
flame and nearly the same for an acetylene-air flame. The advantages of conducting 
experiments at low pressure are the following; 

(1) At 1 atm. the reaction zone has a thickness of some hundredths of a mm. 
and therefore a temperature gradient of about 10* to 10* °C/cm. In such a tem¬ 
perature field, even with very thin wires, there is such a high heat flow that all 
details are lost. In the low pressure flame the temperature gradient is more than a 



Measurements in the reaction zone of a Bunsen flame 617 


hundred times as small, bo that there is a possibility of measuring the real inorease 
of the temperature. 

(2) At 1 atm, even in an air flame the temperature of the wires beoomes 
so high that a platinum/platinum-iridium thermocouple will melt. Furthermore, 
platinum wires give a high catalytic effect which will distort the course of the real 
temperature changes. At low temperature, the heat transfer from gas to wire is 
smaller, so that the temperature difference between the gas and the wire increases. 
In an acetylene-oxygen flame at about 5 mm. Hg the temperature of the thermo¬ 
couple does not rise higher than about 1300° C, so that Hoskins thermocouples 
(Cromel-Alumel) or Ni-NiCr can be used, the real temperature being estimated 
afterwards. 

(3) The presence of the thermocouple in the flame influences its shape. If the 
temperature rise is to be measured, the junction of the thermocouple must be 
visible with a telescope in the image of the luminous flame. In an ordinary flame 
the surroundings at the tip of the thermocouple are changed in the manner shown 
in figure 7, hence it is very difficult to measure the position of the thermocouple 
relative to the flame front. However, if the flame is near the minimum point of the 
stability region, it has the smallest extension possible at this pressure. Each part 
of the reaction zone influences the other; if this were not the case, then the flame 
might be supposed to consist of two independent parts and only one such part 
would be the smallest flame. The effect of this is that the thermocouple indeed 
influences the flame front, but only in such a way that the whole flame front rises 
a little when it is introduced, and the effect is easily corrected. 



Figure 7. The influence of the thermocouple on the flame in different 
parts of the stability region. 


For the measurement of the temperature, the procedure was as follows (figure 8): 
the thermocouple was brought into the flame from below, supported by tubesvof 
ceramic, which were put one into the other like a telescope. The flow of the mixture 
of the gases was therefore interfered with as little as possible. The Reynolds number 
referred to the diameter of the wire was about ^. The thermocouple consisted of 


34'2 




518 


H. Klaukens and H. G* Wolf hard 


tV mm. Ni-NiCr or mm. Hoskins: both were used to eliminate the influence of 
the heat conductivity in the wire. There was a small difference between the measure¬ 
ments, but sufficiently small that the mm. wire could be taken to give nearly 
the correct temperature distribution across the flame, disregarding the influence 



of radiation. The influence of radiation cannot be eliminated by reducing the 
diameter of the wire, because both the radiation and the heat transfer from the 
flame to the wire is a surface effect in contrast to the heat conductivity in the wire 
which is a volume effect. The thermocouple could be moved up and down so that 
it was possible to make measurements across the flame front. The thermoelectric 
e.m.f, was measured with a millivoltmeter whose total resistance was about 5000Q, 
and the loss of e.m.f. by the resistance of the thermocouple could be neglected. 




Measurements in the reaction zone of a Bunsen flame 


519 


The results are shown in figures 9 a and 6. The abscissae shows the vertical extension 
of the flame in mm.; the ordinate the temperature in degrees centigrade and also 
in ljT (T « 0 abs. temp.), and with the scale at the right-hand side the calculated 
intensity of the Schlieren image (i.e. the slope of the 1 [T curve). The extension of 
the flame is plotted in the figures as it is seen in the telescope. The top of the burner 
is marked; the two lightly shaded inner limits are the contrast zones of the luminous 
cone: between these limits the intensity is greatest. The two outer limits show the 
first and last impression of light which could be seen in the telescope. The supposed 
intensity distribution of the radiation of the luminous zone is drawn in dotted lines. 

Attention had to be paid to catalytic effects. These were demonstrated by 
a platinum wire brought into the flame from below. As soon as its upper end had 
become red hot, the whole wire glowed down through the whole mixture. The 
measurements were commenced some cm. below the flame in the cold mixture. 
The thermocouple was then always raised about 0*5 mm.: the millivoltage, as well 
as the position of the end of the thermocouple as seen in the telescope, being 
observed. The voltmeter readings usually vary rapidly. Sometimes, however, it 
took nearly 10 sec. before the reading stopped changing, thereby indicating that 
catalytic effects were falsifying the result. A further check as to the presence of 
catalytic effects is the following: if the thermocouple is far inside the flame, then 
the leads glow down to the beginning of the reaction zone. The Ni wire showed more 
oatalytic effect than the Ni-Cr wire, the latter bearing an oxide layer. If the Ni wire 
showed this glow for a longer distance than the Ni-Cr wire then catalytic effects 
were present. Only flames with an excess of acetylene showed such catalytic effects. 

The correction for the error due to radiation is made in the following manner: the 
end of the thermocouple is a little fused bead of 0*6 mm. diameter between 0*1 mm. 
Ni and Ni-Cr wires and is covered with an oxide layer from welding. The heat 
loss is: 

8 = emission coefficient, 

F w area of the sphere. 


L - F<rT*s 


The heat transfer to the wire is: 


Q-*F(T % -T t ) 


T z «temperature of gas, 


T x = temperature of thermocouple. 

The heat transfer coefficient (a) depends on Nueselt’s number (Nu): 

ad d = diameter of the sphere, 

A A = heat conductivity coefficient. 

The Nu number depends on the Reynolds number, which can be calculated exactly: 

v = velocity of the gas in the flame, 
vdp d = diameter of the sphere, 

p p = density of the gas, 

p = viscosity. 


Re ■■ 



520 


H. Klaukens and H, G. Wolfhard 


The dependence of Nu »/(iZe) has only been measured for a long straight wire by 
Hilpert ( 1933 ) and King, and also only for Reynolds numbers down to about 1 . The 
dependence of the Nusselt-number on the Reynolds number is assumed to be nearly 
the same for a straight wire as for a sphere, so that the difference in temperature 
between the thermocouple and the flame can be estimated. It is not necessary for 
this correction to be absolutely exact, because in these measurements only the 
relative vertical distribution of the temperature is of interest. 



Fiotjbe 9a. Temperature distribution of a C t H,-O t flame. Pressure 4*3 mm. Hg. 
l*90cC g H t . 12*2cm. a /»©c. O g . 

Consider now the course of the temperature curve in figure 96, beginning some 
cm. below the flame in the cold mixture. The thermocouple shows a temperature 
perhaps 20 ° C too high. The whole vessel is heated and therefore also the quartz 
tube containing the thermocouple. By heat radiation it acquires a somewhat 
higher temperature than the mixture is supposed to have. At 12*4 cm. the rise of 
temperature begins. The position in which this first rise takes place is definite and 
is not affected by external factors. The temperature increases now with a gradient 
of about 1*5 x 10* 0 C cm.* 1 . By comparison of the temperature gradient obtained 




521 


Measurements in the reaction zone of a Bunsen flame 

with the 0-3 and 0*1 mm. wire thermocouple the true gradient is estimated to be 
about 10 % higher than represented in figure 96. A correction of suoh a size does 
not cause much change in the character of the ourve. Catalytic effects do not seem to 
affect this measurement. A short distance beyond the reaction zone the temperature 
decreases again due to heat loss by radiation and the heat conductivity of the gas. 
Plotting 1/T, the gradient of the density, and with it the gradient of the refractive 
index, is obtained. The maximum of this gradient gives the position of the Schlieren 



Figure 96. Temperature distribution of a C,H,-air dame. Pressure 14-8 mm. Hg. 
115 cm.*/Buc. C,H,. 16*5 cm.*/sec. air. 


image. It is seen that the Schlieren image should be far in front of the visible flame 
front, as is observed in the experiment. Thus it is shown that the temperature riso 
begins far before the radiation from the C 9 and CH moleoules commences. Figure 9a 
gives a temperature ourve of an oxygen-flame at 4*3 mm. Hg. Figure 10 gives the 
distance of the maximum of the Schlieren image from the intensity maximum of 
the luminous region, depending on the percentage of the acetylene in air or oxygen. 
Especially good agreement is shown by the fact that the distance decreases a little 
with decrease of the acetylene in the mixture. It was not possible to use the same 





522 H. Kkukens and fit 9* ^btfhard 

■ , ■ , ■ ■. ■ . ■ ■ - - ,, ■ ■, 

exoegs of acetylene with an oxygen flame bus for an air dame, so both curves do not 
cover the same range, but it will be noticed that the distance between the centre 
of the Schlieren image and the luminous cone is smaller for an oxygen than for an 
air flame. For this comparison the pressure of the flames was reduced to 10 mm. Hg, 
making use of the relation that the linear extension of the flame is inversely 
proportional to the pressure. This fact, together with the possibility of the presence 
of intermediate organic compounds (see above), explains the difficulties of the 
separation and the partial overlapping of the two graphs for an oxygen-flame. 



0 12 3 

A 


Fiotraa 10. Distonoe between centre of Schlieren and flame-light pictures, and thickness of 

flame-light cone.complete thickness of flame-light cone,-distance between top of 

Schlieren picture and flame-light cone. Pressure reduced to 10mm. Hg. A =» air/11'Ox C,H, 
for air flame, A = 0,/2'6 x C,H, for oxygen flame. 

The determination of the temperature distribution across the aoetylene-oxygen 
flame (figure 9a) is more accurate than that for the acetylene-air flame (figure 96) 
beoause the pressure is lower, and therefore the extension of the flame is longer. 
Therefore, this curve is worth more detailed consideration. 

Errors due to the radiation from the thermocouple must first be corrected, and 
bo the curve t (oorr.) obtained (figure 11). The highest temperature may be wrong 
by about 300° C, but this does not change the character of the curve. In the 




Klaukens and Wolf hard 


Proc. Roy. Soc. A, volume 193 , plate 13 



Fiauttic 5. Kehlioron picture of a CjHj-air liana 4 at 1 atm* 



Fxguuk G. Sohlioron picture of a CjHj-air Game at 80 mm. Hg. 


{Facing p. 62 * 2 ) 




523 


Measurements in the reaction zone of a Bunsen flame 

middle part of the ourve (about 2*0 cm.) there is a temperature gradient of250° C/mm, 
This means that a heat current!? streams in the diieotio& of the cold mixture. 

A = heat conductivity ooeffioient, 

B = XF^-- F * area of the flame, 

ox 

dtjdx- temperature gradient. 

3000 


2000 

o 

o 


1000 


0 

4 3 2 10 

vertical extension of the flame (om.) 

Fktobe 11 . The heat conductivity in the flame. C,H,-0, flame. 4*3 nun. Hg. 
l<90om.*/sec. C t H,. 12*3cm.*/seo. O t . 




\\ 

‘ 



1 

t 

temp, rise 
by reaction 

real course 
of temp.(tcorr) 



i 




t^(°Cuncorr 

) 

i \ \ 

l\\ 

t \ 

\ * 

\ temp, rise by 
’Theat conducti 

,vity 



4 


velocity of} 
react ioTT 

-/ 



_^ 


,\ . . j 






524 H. Khukem md B. Q. Wolfhard 

Dividing the vertical extension of the flame into different sections! sections 
(1 to 2); (2 to 3); (3 to 4) ... and considering the section (9 to 10), the slope of the 
curve is nearly the same at position 9 as at 10, therefore as much heat is coming 
in at 10 as going out at 9, The rise of the temperature (about 500° C) is therefore 
caused entirely by the reaction. The change ocours where the slope of the curve is not 
constant. Considering the section (6 to 7), at 7 there is a heat flow 7*4 kcal./h; 
at 6 a heat flow of B » 3*0 kcal,/h. The difference of 4*4 increases the temperature 
by about 270° C. The rise of temperature is the same within the accuracy of the 
measurement. This means that no reaction is necessary in this region to explain 
the temperature rise. Such calculations can be made for all seotions, and so the 
result is obtained that up to the centre of the section (7 to 8) the whole rise of the 
temperature can be explained by the heat flow due to conductivity alone. The 
dotted curve gives the rise of the temperature due to the change of the slope of the 
measured temperature curve and, to a smaller extent, to the temperature dependence 
of the heat conductivity coefficient. Considering next section (12 to 13)^ in this case 
more heat is coming out than is going in. This means that the heat flow from that 
region reduces the temperature rise which would be caused by the reaction alone. 
The dotted line therefore drops in this part of the flame zone. The difference 
between the measured temperature curve and the dotted line is the temperature 
curve which would be caused by the reaction alone. The slope of this curve is the 
reaction velocity (strong line). The accuracy of the reaction velocity curve obviously 
cannot be very high, and it is therefore unprofitable to consider any further the 
reaction velocity curve in relation to the location of the visible flame picture; that 
must wait until more accurate measurements are available. These could be obtained 
by conducting experiments at about 1 mm. Hg, or with thinner thermocouples. 

The facts which can be gathered from this investigation are: 

1. The heat flow heats the cool mixture to a temperature of about 800° C. The 
usual ignition temperatures lie at about 300° C for acetylene-oxygen and acetylene- 
air flames, but at 1 atm. with an oxygen flame we have induction times of about 
1 x 10“ 6 sec. in the flame, and it is very uncertain if a temperature of 800° C is 
sufficient to explain the start of inflammation as a thermal explosion. 

2. The heat flow forms a preheating zone which is larger in an air flame than in 
an oxygen flame. 

3. The Schlieren picture shows in the first temperature rise at about 200° C. 
In this region the whole temperature rise is due to heat conductivity, and the 
reaction velocity seems to be zero. 

We wish to thank Sir Alfred Egerton for his interest in the work and for his 
help in its preparation for publication. 

References 

Hilpert, R. 1933 Forach . IngWca, p. 216. 

Van dor Poll, A. N. T. & Weaterdijk, T. 1941 Z, techn , Phya . 22, 29 , 

Wolfhard, H. G. 1943 Z. techn . Phya . 24 , 208 , 



The influence of incombustible vapours on the limits of 
inflammability of gases and vapours in air 

J. H. Burgoyne D.Sc. and G. Williams ^Leir 

Dejxirtment of Chemical Engineering and Applied Chemistry > 
Imperial College , London 

(Communicated by Sir Alfred C. Egerton. SecM.S — 

Received 12 November 1947) 


The influence of some inoombuatible vapours upon the upward propagation of flame through 
mixtures of combustible gases and vapoure with air contained in a 11 in. glass tube has been 
examined and some comparisons made with results in a 4 in. tube. From the data obtained 
with the smaller tube it has been possible to plot diagrams for each of the combustibles with 
inert vapour and air showing completely the compositions of inflammable and non-inflammable 
mixtures. From these, the limiting ‘safe’ composition of mixtures of inert vapour with air 
and with the combustible respectively have been derived. 

The combinations studied were hydrogen, methane, ethylene, carbon monoxide, n-hexane, 
cyclohexane and benzene with methyl bromide; hydrogen and n-hexane with difluorodi- 
chloromethane, and methane with carbon tetrachloride, A sample of coal gas, the composition 
of which was determined, has also been examined with methyl bromide as the inert vapour, 
and it is shown that the limit curve agrees fairly well with that calculated by Le Chatelier’s 
rule from the curves for its constituents. 


Introduction 

It is well known that the limits of inflammability of combustible gases in air are 
narrowed and ultimately extinguished by additions of incombustible gas or vapour. 
A large amount of actual data on this subject has been collected by Coward 
& Jones ( 1938 ) and there have been more recent publications on the subject too 
numerous to mention individually. 

Recent tendencies in connexion with fire extinguishing techniques have however 
aroused interest in certain materials, concerning which the literature provides only 
limited information. Outstanding among these is methyl bromide. Another material 
which has practical possibilities as a fire extinguisher at the present time is difluoro- 
dichloromethane (freon 12 ), which has hitherto found its chief use as a refrigerant. 

Dufraisse, Vieillefosse & Le Braz ( 1933 ) showed that, under specified conditions, 
10 % of methyl bromide in the air supply extinguished the flame of coal gas 
burning at a jet, and Olsen & Graddis ( 1938 ) published the results of some measure¬ 
ments relating to the suppression of flame propagation in methane-air mixtures. 
On plotting their figures in a suitable manner, it is found that they indicate that 
the presence of 5*8 % of methyl bromide in air suppresses flame propagation with 
methane in any proportion, and that the presenoe of 41*2 % in methane suppresses 
flame propagation in air. 

This incomplete information has now been supplemented by systematic measure¬ 
ments from which limits of inflammability have been plotted completely for 

[ 526 ] 



526 J. H. Burgoyne and 6. Williams-Leir 

ternary systems comprising methyl bromide and air, together with representative 
combustible gases and vapours. The combustibles concerned are hydrogen, carbon 
monoxide, methane, ethylene, a sample of coal gas of known composition, and the 
vapours of n-hexane, cyolohexane and benzene. 

. Measurements have also been made with freon, but referring only to hydrogen and 
n-hexane as the combustibles. However, the systematic measurements with methyl 
bromide, as well as existing data with carbon dioxide and nitrogen as extinguishers, 
allow a reasonable estimate of its effect on other combustibles to be made from 
these two cases along with those of methane (Jones & Perrott 1930), propane and 
w-butane (Jones & Scott 1946). 

Finally, the effect of carbon tetrachloride on methane-air flames has been studied. 
This has already been done by Coward & Jones (1926) and thus a link is provided 
between our present work and that of the U.S. Bureau of Mines, where, in work 
extending over many years, experimental conditions similar to our own have been 
employed. 

Experimental 
Apparatus and method 

General. The conditions adopted for the main determinations of limits of in¬ 
flammability were similar to those defined by Jones (1929) as being reasonably 
representative of conditions in large volumes of gas. The tube in which mixtures 
were prepared and submitted to test for flame propagation was of pyrex, about 
5 ft. long and If in. bore. It was set up vertically with the upper end closed and 
for the purpose of testing the prepared mixture the lower end was momentarily 
uncovered and a 1 in. coal gas flame applied. If flame then propagated throughout 
the complete length of the tube the mixture was considered to be within the 
limiting composition, but not otherwise. In order to verify the assumption that 
an increase in tube diameter over the figure given does not materially alter the 
limits, selected mixtures were examined using a 4 in. diameter tube, but otherwise 
under the conditions described. 

Two different methods were employed for preparing the mixtures to be tested. 
When using the combustible gases (hydrogen, carbon monoxide, methane, ethylene, 
and coal gas) with methyl bromide a flow method was used. Although this was 
somewhat extravagant of materials it eliminated mixing difficulties, which, as is 
well known, can be serious with a long tube. For mixtures incorporating vapours 
of substanoes liquid at ordinary temperatures (hexane, cyclohexane, benzene and 
carbon tetraohloride) the flow method is less simply applied, and a statio method 
was preferred, in which the tube was evacuated, the mixture made up from its 
constituents by pressures, and thoroughly mixed by means of a fan as used by 
Egerton & Powling (1948). The two methods are described in detail in the following 
paragraphs. 

Flow method. Into the upper end of the ignition tube were fed continuously the 
three gases composing the mixture for test (combustible, air and methyl bromide). 



Influence of incombustible vapours on inflammability of gases 527 

These were stored in cylinders and their rates of flow were controlled by reducing 
valves followed by hand-operated needle valves, using calibrated capillary flow 
gauges for observation. In the case of methyl bromide the reducing valve was not 
needed, but in order to avoid cooling of the liquid by evaporation the container was 
immersed in a large volume of water at room temperature. 

The relative rates of flow of the gases determined the composition of the mixture 
flowing down the ignition tube. Mixing was facilitated by means of a glass wool 
plug in the head of the tube and there was every indication that this was quite 
effective. The total rate of flow of mixture was controlled so that the tube was 
swept out once in about 30 seconds. 

The lower end of the ignition tube was closed by a brass disk, carrying a £ in. 
lead-off tube for waste gases, and seating on a flange sealed to the ignition tube. 
In order to test the inflammability of the mixture in the tube at any time, the brass 
disk was slid off and a 1 in. coal gas flame applied to the mouth of the tube. 
A critical point was normally decided on ten failures to propagate or two propaga¬ 
tions, whichever first occurred. The difference between a pair of points, one within 
and the other outside the limit of inflammability, did not usually exceed 2 mm. of 
flow gauge head. This represented from 1 % of the amount of the component of 
which there was most in the mixture under consideration, to a maximum of 3 % 
of a component of which there was only a small proportion present. 

In order to extinguish flames which continued to bum at the top of the tube 
following propagation, a fourth gas connexion was provided through which carbon 
dioxide could be introduced if required. 

On account of the low luminosity of hydrogen-air flames, a platinum resistance 
thermometer in a balanced bridge circuit was installed at the head of the ignition 
tube for the purpose of detecting such flames. A thermocouple detector was also 
used with these and other mixtures. 

Before the apparatus was developed into the form described, the question of 
technique was carefully considered, and a number of preliminary tests were carried 
out. It was shown by means of comparative measurements upon stationary 
mixtures, that the small downward velocity imparted to the gases had no appreciable 
effect upon the location *of the limits of flame propagation. That this is so is due 
to the fact that flame speeds do not fall off progressively to zero as the limit is 
approached, but always have a finite value even at the limit. If, then, the rate of 
linear flow of the gases is small compared with the limit flame speed, the readings 
of limit mixture composition are not affected. 

In experiments with the lower end of the ignition tube open to the atmosphere 
it was found that, in spite of the downward gas flow there was evidence of a 
tendency for air to diffuse a short distanoe up the tube, thus weakening the 
prepared mixture. 

Static method. The apparatus used for measurements involving vapours of 
materials liquid at ordinary temperatures is illustrated diagrammatically in 
figure 1. The pyrex ignition tube, which was of the standard dimensions previously 



528 


J. H. Burgoyne and G. Williams-Leir 

specified and was closed at the lower end by a vacuum-tight removable brass cap, 
was evacuated together with the blower and circulating tubes by means of a hyvao 
pump. The liquid constituent was allowed to evaporate into the circuit from a bulb 
until the required pressure, as indicated by the mercury manometer, was reached. 
The other gas or vapour was similarly admitted, as required, from a cylinder 
controlled by a needle-valve. The blower was started and air was let in through 
calcium chloride, a length of capillary tube limiting its rate of entry. When 
atmospheric pressure had been reached and the constituents well mixed the blower 



Figure 1. Static method. Key to diagram: A air inlot via calcium chloride drying tube; 
B mercury manometer; C ignition tube; D vacuum-tight removable brass cap; JS scrubber 
(potassium hydroxide solution) with spray trap and catchpot; F potassium hydroxide drying 
tower; G by vac pump; H blower-mixer; J bulb containing liquid constituent; K gas cylinder. 

was stopped, the brass cap removed from the ignition tube and the standard flame 
applied. After the test air was drawn through the ignition tube by the hyvao pump, 
sweeping the contents through the caustic potash scrubbers. The cap was then 
replaced and re-evacuation commenced. 

The condensable vapours were, of course, absorbed very rapidly by rubber 
surfaces and especially by rubber grease* The former were minimized by using 
butted joints for glass tubes; hydrocarbon-resistant lubricants were used for 
glass taps. Some difficulty was at first experienced in obtaining a good vacuum 
seal at the cap without the use of lubricant. The detail diagram in figure 2 shows 
the method by which this was overcome, A totally enclosed rubber washer was 
compressed by means of a screw on to a § in. flange round a f in. hole. 



Influence of incombustible vapours on inflammability of gases 529 

Preliminary determinations were made with a form of apparatus in whioh the 
constituents assembled in the ignition tube were stirred by a manually operated 
piston. This was carried on a wire which left the top of the tube through two 
coaxial rubber tubes tightened on to it with a screw dip. Glycerol was used as 
lubricant. 



Figurk 2. Detail of brass closure. 


Results obtained with the later form of apparatus showed no marked differences 
from the preliminary ones, but the improved method of stirring was less laborious 
and could be operated during admission of air to complete the mixtures. The latter 
was found to be desirable on account of a tendency for vapours to condense on the 
tube walls due to regional compression as air was admitted to the stagnant mixture, 
Such condensate was not easily re-evaporated by piston mixing. 

Materials . The sources and characteristics of the materials employed are given 
below. The gases, with the exception of air in the static method, were not specially 
dried. Being derived from high-pressure cylinders however, their moisture content 
should be unimportant for present purposes. In the case of the liquids, benzene, 
hexane, and cyclohexane, a considerable amount of vapour was pumped off before 
use. The last part of the liquid was also rejected so that only the middle fraction 
was employed in the tests. 

(1) Hydrogen, From the British Oxygen Co. Ltd. Guaranteed purity of 99*2 to 
99*5 %. Analysis by combustion showed 99*6 %. 

(2) Carbon monoxide . Prepared by the reaction of formic and sulphuric acids 
at 100° C. The product was freed from acid gases by scrubbing with caustic potash 
solution and compressed. It showed more than 99 % absorption in ammoniacal 
cuprous chloride. 

(3) Methane . Derived from colliery blowhole gas (initially 96*5 %) which was 
freed from C 0 8 by caustic potash and liquefied. The liquid was evaporated under 
pressure into cylinders, giving a product of practically 100 % purity. 

( 4 ) Ethylene .. From the British Oxygen Co. Ltd. Prepared by the reaction of 
ethyl alcohol with sulphuric acid. Had a guaranteed purity of 97*5 % rising to 
99 %, 99*4 % found to be absorbed by fuming sulphuric acid. 



530 J. H. Burgoyne and G. WiUiams-Leir , 

( 5 ) Coal gas. Compressed from mains supply {Gas Light and Coke Co. Ltd.): 
benzole extracted. Found by analysis to have the following composition: CO a , 2* 7 %; 
unsaturated hydrocarbons, 3-4 %; 0 2 > 1*0 %;CO, 12-3 %;H 2 , 48*6 %;CH 4l 2 O* 0 %; 
C 2 H e , 0*6 %; N 2 (by difference), 10*8 %. 

(6) Benzene. £ CrystallizabIe\ From Hopkin and Williams Ltd. 

( 7 ) n-Hexane. B.D.H. Laboratory Reagent (‘free from aromatic hydrocarbons ’). 

(8) Cyclohexane. From Research Laboratory, Hastman Kodak Co., Rochester, N.Y. 

( 9 ) Air. Compressed from laboratory atmosphere without special treatment of 
any kind for the flow method. Drawn from atmosphere through calcium chloride 
for the static method. 

(10) Methyl bromide. From May and Baker Ltd. ‘Pure’grade. 

(11) Carbon tetrachloride. From Hopkin and Williams Ltd. £ Pure’grade. 

(12) Freon 12. From the Frigidaire Corporation. A low temperature vacuum 
distillation revealed the presence of a small amount of impurity less volatile than 
freon 12. In practice, therefore, the last 10 to 20 % of the contents of a cylinder 
were rejected. The fraction used had a vapour density agreeing within the limits 
of measurement with the formula density of CC 1 2 F 2 . 

Results 

Influence of tube diameter on limits for combustible-air mixtures. Comparative 
measurements were made, in 1 J in. and 4 in. diameter pyrex tubes, of the limits in air 
of hydrogen, carbon monoxide, ethylene, w-hexane and cyclohexane. The results are 
given in Table 1. As far as lower limits are concerned there is evidently little effect of 
tube diameter above 2 in.* This is in accordance with accepted data. 

Figures for the upper limits call for more consideration. Those for hydrogen and 
carbon monoxide show little diameter effect, but with hexane and with ethylene 
there is a rather unexpected lowering with increase of tube diameter. 

Table 1. Influence of tube diameter on limits of inflammability 


IN AIR (STATIC METHOD) 

limits (%) in If in. tube limits (%) in 4 in. tube 

. .— - j. .. 


combustible 

lower 

upper 

lower 

upper 

hydrogen 

4*0 

73*8 

4*0 

75*0 

carbon monoxide 

15*8 

68*6 

14*6 

68*4 

ethylene 

3-15 

t 

3*20 

22*7 

cyclohexane 

1*60 

— 

1*33 

6*20 

n-hexane 

1*46 

7*60 

1*46 

5*70 


t 32*0 using flame detector; 28 to 29% by visual observation. 


Use of flame detector . The use of a resistance thermometer or thermocouple as 
flame detector at the top of the tube was found to facilitate the placing of limits 
in most cases, but was indispensable for the lower limit of hydrogen on account of 

* The discrepancy between the lower limits for carbon monoxide is not more than is to be 
expected, in this case, in experiments in which the moisture content of the gases is not strictly 
controlled. 



Influence of incombustible vapours on inflammability of gases 531 

low luminosity. In general, a substantial throw was registered within the in¬ 
flammable range, no matter how close to the limit. In other words, there was 
a discontinuity in the magnitude of the throw in passing through the limit. The 
lower limit of hydrogen provided an exception, however, for here the throw 
decreased regularly to zero at the limit. No doubt this exceptional behaviour is 
connected with the fact, first noted by Coward & Brinsley (1914), that as the lower 
limit for hydrogen (upward propagation) is approached combustion becomes pro¬ 
gressively less complete. In the presence of more than about 10 % of methyl 
bromide the variation of throw at the lower limit of hydrogen became discontinuous. 

In only one case did the use of the flame detector reveal anything entirely 
unexpected. This was with ethylene (static method), with which the detector 
continued to give throws well above the visually estimated upper limit of 28 to 
29 %. With the aid of the detector the upper limilTwas finally located at 32 * 0 %. 
In the 4 in. tube however the detector indicated no flame capable of indefinite 
propagation above the visual upper limit, which was 22*7 %. 



Figube 3. Limit curves. 

Limits of inflammability of combustible-methyl bromide-air mixtures . In figure 3 
the limit curves as determined in a 1| in. tube for hydrogen, carbon monoxide, 
methane, ethylene (flow method) and w-hexane vapour (static method) with air 
and methyl bromide are shown together. In these diagrams the limiting percentages 
(on a basis of volume) of combustible and methyl bromide for flame propagation 
under the conditions of measurement, are plotted respectively as abscissa and 
ordinate. The balance to 100 % at any point represents the proportion of air. All 
points in the areas enclosed by the curves represent mixtures of the three con¬ 
stituents which are capable of flame propagation under the experimental con¬ 
ditions described. Points outside the curves represent mixtures which are incapable 
of flame propagation. * 

The upper limit for hydrogen in the absence of methyl bromide was not deter¬ 
mined in the flow apparatus but a value of 73*8 % was obtained in the static 


Vol. 193. A. 


35 



532 


J. H* Burgoyne and G. Williams-Leir 

apparatus and this has been adopted. The adjacent part of the extinction limit 
curve was interpolated to meet the part determined experimentally, and is shown 
by a broken line in figure 3. 

The relations given in these diagrams are the result of drawing smooth curves 
through experimentally determined points representing mixtures on the limit 
of flame propagation. From large scale drafts of these, points were read off 
systematically and are tabulated in table 2. The table also contains the results for 
coal gas, cyclohexane and benzene, and for w-hexane in a 4 in. tube. The vapour 
pressures of cyclohexane and benzene at the laboratory temperatures prevailing 
when this work was done were insufficient to enable their upper limits (in absence 
of methyl bromide) to be reached. The fact that the area of inflammability of the 
hexane-methyl bromide system was found to be less in the 4 in. than in the smaller 
tube is difficult to explain. It seems however that this observation is associated 


Table 2. Composition op mixtures (incorporating methyl bromide) at 

THE LIMIT OF INFLAMMABILITY, DERIVED FROM SMOOTHED CURVES 


methyl 

bromide 

hydrogen 

carbon monoxide 

ethylene 

(%) 

lower 

upper 

lower 

upper 

lower 

upper 

0 

4*0 

73*8* 

15*3 

67*4 

30 

24*5 

0*5 

4*0 

64*0 

15*0 

62*6 

3*25 

21*8 

1*0 

4*0 

57*5 

15*0 

57*8 

3*4 

19*25 

1*5 

3*9 

52*6 

16*0 

63*6 

3*55 

17*0 

2*0 

3*9 

48*3 

15*1 

49*6 

3*6 

15*0 

2*5 

3*8 

45*0 

15*2 

45*6 

3*6 

13*35 

3*0 

3*8 

41*8 

15*3 

41*9 

3*5 

12*1 

3*5 

3*8 

39*2 

15*4 

38*3 

3*45 

11*2 

4*0 

3*8 

36*7 

15*6 

34*8 

3*4 

10-45 

4*5 

3*7 

34*5 

15*6 

31*5 

3*3 

9*95 

5*0 

3*7 

32*5 

15*6 

28*2 

3*2 

9*45 

5*5 

3*7 

30*7 

15*6 

25*0 

3*1 

8-95 

6*0 

3*7 

29*0 

15*7 

21*3 

3*0 

8*5 

6*5 

3*6 

27*6 

— 

— 

2*9 

8*0 

7*0 

3*6 

26*3 

— 

— 

2*8 

7*55 

7-5 

3*6 

25*2 

— 

— 

2*7 

7*05 

8*0 

3*6 

24*1 

. 

— 

2-65 

6*55 

8*5 

3*6 

22*9 

— 

.... 

2*46 

6*05 

9*0 

3*5 

21*7 

— 

— 

2*3 

5*6 

9*5 

3*5 

20*5 

* 

— 

2*1 

5*1 

10*0 

3*2 

19*1 

— 

— 

2*0 

4*6 

10*5 

3*0 

17*8 

— 

— 

1*95 

4*1 

11*0 

2*7 

16*4 

— 

— 

2-0 

3-65 

11*5 

2*5 

14*8 

— 

— 

2*35 

3-15 

12*0 

2*4 

13*3 

— 

— 

— 

— 

12*5 

2*6 

11*6 

— 

— 

— 

— 

13*0 

3*1 

9*8 

— 

— 

— 

— 

13*5 

5*3 

7*8 

— 

— 

— 

— 

pea£(%): 







combustible 

6*7 


17*7 


2*75 



methyl bromide 13*7 6*2 11*65 



methyl 

bromide 


Influence of incombustible vapours on inflammability of gases 533 

Table 2 (cont.) 

coal gas 


methane 


n-hexane 


(%) 

lower 

upper 

lower 

upper 

lower 

upper 

0 

5*20 

13*50 

5*0 

33*0 

1*45 

7*50 

0-5 

6*52 

12*14 

5*15 

27*9 

1*50 

5*75 

1*0 

5-7o' 

11*00 

5*3 

20-4 

1*55 

5*3 

1*5 

5*80 

10*02 

5*4 

24*4 

1*60 

4*9 

2*0 

5*80 

9*20 

5*56 

23*0 

1*65 

4*6 

2*5 

5*72 

8*56 

5*65 

21*85 

1*70 

4*26 

30 

5*54 

8*00 

5*8 

20*8 

1*70 

3*95 

3*5 

5*40 

7*54 

5*9 

19*85 

1*75 

3*7 

4*0 

5*46 

7*12 

6*0 

18-96 

1*76 

3*5 

4*5 

6*64 

6*56 

6*1 

18*0 

1*75 

3*3 

5*0 

— 

-— 

6*1 

17*1 

1*75 

3*1 

5*5 

— 


0*0 

16*2 

1*75 

2*9 

6*0 


— 

5*8 

15*35 

1*75 

2*7 

6*6 

— 


5*65 

14*5 

1*75 

2*5 

70 


— 

5*45 

13*7 

1*90 

2*25 

7*5 

— 


5*2 

12*9 

— 

— 

8*0 

— 

— 

5*0 

12*1 

— 


8*5 

— 

— 

4*75 

11*2 

— 

— 

9*0 

— 

— 

4*5 

10*3 

— 

— 

9*6 

— 

— 

4*2 

9*25 

— 

— 

10*0 

—- 

— 

4*05 

8*1 

— 

— 

10*6 

— 

— 

4*15 

6*8 

— 

— 


peak (%): 

combustible 6*0 

methyl bromide 4*7 


4*85 

11*0 


21 

705 


methyl 

bromide 

cyclohexane 

benzene 

n-hoxane in 4 in. tube 

(%> 

lower 

uppor 

lower 

upper 

lower 

upper 

0 

1*50 

t 

1*66 

t 

1*45 

5*70 

0*5 

1*55 

t 

1*70 

t 

1*50 

6*1 

1*0 

1*55 

t 

1*75 

t 

1*55 

4*65 

1*6 

1*60 

t 

1*80 

t 

1*60 

4*3 

2*0 

1*65 

t 

1*85 

t 

1*65 

4*0 

2*5 

1*70 

t 

1*90 

t 

1*70 

3*8 

3*0 

1*75 

t 

1*95 

t 

1*75 

3*6 

3*5 

1*80 

t 

2*00 

t 

1*80 

3*45 

4*0 

1*80 

t 

2*05 

t 

1*85 

3*3 

4*5 

1*85 

t 

2*10 

t 

1*90 

3*15 

6*0 

1*90 

t 

2*15 

t 

1*95 

2*95 

5*5 

1*95 

t 

2*15 

3-35 

2*0 

2*7 

6*0 

2*00 

3-3 

2*16 

306 

— 


6*5 

2*00 

31 

2*05 

2-85 


— 

7*0 

2*05 

2-8 

1*90 

2-60 

— 

--- 

7*5 

— 

— 

1*95 

2*46 

— 

— 

peak (%): 







combustible 


2-25 


2-26 


2*2 


methyl bromide 7-4 

♦ Value determined in static apparatus. 


7-75 


6-0 


t Not determined by the authors. 




534 


J. EL Burgoyne and G. Williams-Leir 

with that already recorded (see table 1)—that the ordinary upper limit for hexane 
in air is lowered in the wider tube. 

The lowering effect of methyl bromide on the lower limit of some of the com¬ 
bustibles examined (e.g. hydrogen) is noteworthy. The effect is probably accountable 
on thermal grounds and may be interpreted as showing the contribution whioh the 
methyl bromide itself makes to the heat of reaction. It is known that it has 
a fairly large heat of combustion ( 184*7 kcal./mol.) (Heilbron 1934) and has been 
caused to ignite within narrow limits in air ( 13*5 to 14*5 %) by very powerful 
electrical ignition sources (Jones 1928), From the practical point of view, it is 
important to remember that proportions of methyl bromide below the extinguishing 
amount can render certain combustible gases more, rather than less inflammable, 
in the sense of lowering the lower limit of flame propagation. 

Also included in table 2 are the mixture compositions represented by the ‘peaks' 
of the several curves (maximum percentage of methyl bromide). It can be seen 
from these figures as well as from figure 3 that extinction of limits by methyl 
bromide is increasingly difficult in the order methane, carbon monoxide, ethylene, 
hydrogen. This is in accordance with previous experience in connexion with 
nitrogen and carbon dioxide as extinguishing agents (Coward & Jones 1938), and 
it will be noted that the order is that of the ratio upper/lower limit percentage of 
combustible in air. 

In table 3 are the results, tabulated as in table 2, for systems incorporating the 
other extinguishing materials, namely difluorodichloromethane (freon 12) with 
hydrogen and with n-hexane, and carbon tetrachloride with methane. The latter 
system has been completely explored by Coward & Jones (1926) and our present 
results are in good agreement with theirs. 

Appearance of flames. The variation in the character of the flame of each com¬ 
bustible in the presence of methyl bromide as the mixture composition passes 
round the boundary of the inflammable region, from the lower to the upper limit, 
will be described. 

At the extremity of the lower limit side of the curve, in the absence of methyl 
bromide, the flame of each combustible except hydrogen was its own characteristic 
shade of pale blue. The flame front was a clear-cut dome"but usually had a less 
distinct rear boundary. As methyl bromide was added, at the expense mainly of 
air, a luminous yellow blended with the lower limit flame. Shortly before the peak 
of the limit curve was reached, the yellow became sufficiently dense to obscure any 
other colour, and the flame increased in length to eight inches or more. Passing 
down the other side of the curve, increasing combustible at the expense mainly 
of methyl bromide, the flames remained luminous, and in the case of benzene, 
became sooty. When a mixture was outside this part of the curve it burned with 
a luminous flame in the atmosphere outside the tube. Within the curve, as the 
upper limit was approached, a more elaborate phenomenon occurred with ethylene, 
hexane, cyclohexane, benzene and coal gas. External burning was accompanied 
by a regular dome-shaped flame travelling up the tube. The expansion produced 



Influence of incombustible vapours on inflammability of gases 535 

by the partial combustion in the tube drove the products down to supply the 
external flame. When the internal flame went out at the top of the tube, the 
ensuing contraction drew in the secondary flame which then travelled irregularly 
a foot or two up the tube and dwindled away. Very near the upper limit, though 
the flames of benzene were luminous, those of hexane and cyclohexane were blue- 
violet, A little further from the limit, at 0 * 72 % hexane for example, this translucent 
flame permitted the observation of a dull grange-red core, concentric with the 
flame-front, of perhaps half the diameter of the tube. This became involuted in 
a complicated scroll form as the flame proceeded. 


Table 3. Composition of mixtures (other than those incorporating 

METHYL BROMIDE) AT LIMIT OF INFLAMMABILITY 


combustible % 


incom¬ 

bustible 

vapour 

(%) 

r 

hydrogen-freon 12 

n-hexane-freon 12 

- > 

methane-carbon 

totrachlorido 

lower 

upper 

lower 

upper 

lower 

upper 

0 

4*0* 

73-8* 

1*40 

7*6 

t 

t. 

1 


— 

1*55 

7*25 

5*5 

t 

2 



1*65 

6*95 

5*8 

13*3 

3 


— 

1*75 

6*7 

5*9 

13*0 

4 



1*8 

6*45 

6*0 

12*8 

5 

4 


1*9 

6*2 

6*2 

12*5 

6 


— 

2*0 

6-0 

6*3 

12*3 

7 

— 

—. 

2*05 

5*75 

6*4 

12*0 

8 



2*15 

5*65 

6*6 

11*4 

9 

— 


2-3 

5*4 

6*9 

10*9 

10 

4*5 

45 

2*46 

5*15 

7*2 

10*3 

11 



2*7 

4*9 

7*6 

9*7 

12 

— 

— 

2*95 

4*55 



13 

— 


3*35 

4*05 

— 

— 

15 

5*5 

37 

— 




20 

6 

29 

— 

-- 



25 

7 

22 



— 

— 

30 

8 

16 


— 




peak (%) 

combustible 10 

incombustible 35 


3-6 

13-5 


8*3 

12*0 


* From table 2. 


t Not determined by the authors. 


A special phenomenon was noted near the lower limit of hydrogen. (At the lower 
limit the flame was invisible and special measures were necessary for its detection, 
as described previously.) With just sufficient methyl bromide to make them visible, 
numbers of yellow spots of roughly 2 mm. diameter circulated rapidly within the 
rising dome. Sometimes the dome was striated with yellow and the spots appeared 
to drip from it. Phenomena of this kind have been previously observed by Coward 
& Brinsley (1914) and an explanation in terms of diffusion suggested by Harteck 
was supported by Goldmann (1929). 



536 


J. H. Burgoyn© and G. Williams-Leir 

Flame-balls of a different character occurred with benzene nearnts upper limit. 
Three or four luminous balls of about 1 cm. diameter passed up the tube within 
a flame-front which, as with some of the more critical mixtures, was less well defined. 

Safe mixtures. The chief contribution of this investigation is the complete 
delimitation of flame-propagating mixtures comprising some incombustible vapours 
and air together with representative combustible gases and vapours. A particular 
use of data of the kind obtained is to establish ‘limiting safe mixtures’ of the 
incombustible vapours with either air or combustible. Thus, if in diagrams of the 
kind shown in figure 3 a tangent is drawn from the origin to the extinction limit 
curve, the constant proportion of extinguisher to combustible thus represented is 
that of a limiting mixture of the two, which under the conditions of experiment 
cannot form explosive mixtures with air. Similarly, a tangent from the point 
combustible = 100 % represents a limiting proportion of extinguisher to air giving 
an atmosphere in which the combustible cannot propagate flame. The two mixtures 
thus defined have a oertain practical significance and may be referred to as ‘limiting 
safe mixtures * with respect to the third component. The content of incombustible 
vapour in such mixtures as determined from the present measurements are shown 
in table 4 . 

Table 4. Limiting safe mixtuees 




limiting safe percentage 
of incombustiblo vapour 




with 

combustible 

incombustible 

with air 

combustible 

hydrogen 

methyl bromide 

14*8 

86 

carbon monoxide 

methyl bromide 

7*6 

28 

methane 

methyl bromide 

6*1 

45 

ethylene 

methyl bromide 

120 

85 

coal-gas 

methyl bromide 

11*6 

72 

n-hexane 

methyl bromide 

7*2 

80 

cyclohexane 

methyl bromide 

7*6 

78 

benzene 

methyl bromide 

7*9 

79 

hydrogen 

freon 12 

39 

78 

n-hexane 

freon 12 

140 

80 

methane 

carbon tetrachlorido 

13*1 

60 


Calculation of extinction limits for coal gas . Jones (1929) has shown how the 
effect of extinguishing materials upon the limits of many combustible gaseous 
mixtures can be approximately calculated using Le Chatelier’s rule "as extended 
by Coward (1919). This states that the limit for a gaseous mixture containing p u 
p Si etc. % of gases having limits of N v N 2i N 3 etc. is given by: * 


100 


- +1 f + e tc. 


The rule applies both to lower and upper limits. 



Influence of incombustible vapours on inflammability of gases 537 

The method may be applied in the present work to calculate, for instance, the 
limits of a 1:1 coal gas-methyl bromide mixture as follows (see table 5). 


Table 5. Calculation of the limits for a 1:1 

COAL GAS-METHYL BROMIDE MIXTURE IN AIR 



(1) 

(2) 

(3) 

(4) 

(5) 

(6) 

(7) 

(8) 






parts 






parts of 



combus¬ 

percen - 




• percentage 

methyl 

parts 

parts 

tible 

tage of 



con¬ 

or parts 

bromide 

CO, 

N, 

and 

diluted 

lower 

upper 

stituent 

per 100 

added 

added 

added 

diluent 

mixture 

limit 

limit 

C t H 4 

3*4 

10*2 

— 

— 

13*6 

7*0 

10*8 

. 14*7 

CO 

12-3 

2*5 


.— 

14-8 

7*6 

18*4 

31*8 

H, 

48*6 

87*3 

— 

— 

135*9 

69*5 

10*4 

21*0 

ch 4 

««{» 

— 

— 

7*0 

21*0 

10*8 

7*5 

19*0 

— 

2*7 

— 

9*9 

5*1 

7*4 

17*4 

CO, 

2-7 

— 

.— 

— 

—, 

— 

— 

— 


7*0 

— 

— 

— 

— 

— 

— 

— 

air 

4*8 

— 

— 

— 

— 

— 

— 

— 

totals 

100*0 

100*0 

2*7 

7*0 

195*2 

100*0 

— 

— 


The coal gas (100 parts by volume) may be sub-divided as in column ( 1 ) into 
combustible, air, nitrogen and carbon dioxide. Allocating methyl bromide (100 parts 
in ail) to C 2 H 4 , CO and H 2 as in column ( 2 ), we get mixtures whose total volume is 
as shown in column (5), with limits derived from figure 3, as in (7) and ( 8 ). Allocating 
the 2*7 parts of C0 2 to some of the methane (column (3)) and the 7 0 parts of 
nitrogen to the rest (column (4)), and employing Jones’ ( 1929 ) extinction limit 
curves, we get further mixtures whose volumes are shown in column (5), having 
limits as in (7) and ( 8 ). The percentage constitution of the diluted mixture (expressed 
in terms of combustible constituents and allocated diluents) is given in column ( 6 ). 
Using Le Chatelier’s rule and allowing for 4*8 % of air present in the coal gas, we 
have, from the figures in columns ( 6 ) and (7) 


lower limit« 


100 


7*0 


7-6 69-5 Hh 8 

10-8 + 18 : 4 + 10 : 4 + 7-5 + 7-4 


_ x - — = 10-6 0/ 

5-1 95-2 /0 ' 


Employing the figures in columns ( 6 ) and ( 8 ), we have, 


upper limit = 


100 

7-0 7-6 69-6 10-8 

1 * 7 + 3Hj + 21'0 + 18*0 


51 
17-4 


21-8«y 

95-2 '~ 


Using this method of calculation, other mixtures of coal gas and methyl bromide 
have been examined, with the results given in table 6 . Comparative experimental 
limits are also given. 




538 J. H. Burgoyn© and G. Williams-Leir 


Table 6„ Comparison of calculated and measured limits of coal gas 


ratio: 

methyl bromide 

calculated limits of diluted mixture 

measured limits of diluted mixture 

- -A. ... 

coal gas 

lower 

upper 

f - 

lower 

upper 

0 

5*5 

37-4 

5*0 

33*0 

0-25 

6*8 

28*1 

6*7 

22*6 

0*5 

8-1 

25*3 

8*7 

20*8 

1*0 

10*6 

21*5 

11*6 

18*8 

2-0 

13-4 

16*7 

13*4 

16*4 

2*93 

14*4 

15*8 

slightly beyond extinction limit 


It will be seen that, while agreement is good as regards the lower limit side of the 
curve, there is rather more divergence on the upper limit side. For most practical 
purposes, however, the errors involved (of the order of 10 %) would not be very 
serious, and it would generally be permissible to employ the method of calculation 
outlined both for upper and lower limits. 


Principal conclusions 

(1) A flow method for the determination of limits of flame propagation in 
complex gas mixtures has been developed and found to give satisfactory results 
less tediously than do static methods. In the form used it is only applicable where 
all components of the mixture are gaseous at atmospheric temperature. 

(2) For mixtures of whioh one constituent is condensable a static method has 
been preferred, but special stirring arrangements have been found necessary. 

(3) By means of the two methods described limits of flame propagation under 
standard conditions have been fully determined for ternary mixtures involving 
a number of combustibles mixed individually with methyl bromide and air. The 
combustibles were placed in the following order of increasing susceptibility to 
extinction by methyl bromide: hydrogen, ethylene, coal gas (composition as stated), 
benzene, cyclohexane, w-hexane, carbon monoxide and methane. Benzene, cyclo¬ 
hexane and n-hexane were approximately equally susceptible. 

(4) Complete limits have also been delineated for mixtures with air of hydrogen 
and freon 12, w-hexane and freon 12; and methane and carbon tetrachloride. 

(5) In some cases additions of methyl bromide appreciably lowered the lower 
limit of flame propagation, although in all cases a sufficient addition of the 
extinguisher always caused the upper and lower limits to converge to the point 
of coincidence, The initial lowering effect was not shown with freon 12 or carbon 
tetrachloride. 

(6) The effect of methyl bromide upon the limits of the sample of (debenzolized) 
coal gas examined was calculated from data for hydrogen, ethylene^ carbon 
monoxide and methane, by the use of Coward’s extension of Le Chatelier’s rule. 
Agreement with experimental results within about 10 %, 



Influence of incombustible vapours on inflammability of gases 539 

This work was carried out in the Department of Chemical Technology, Imperial 
College, London as part of the research programme of the Joint Fire Research 
Organization of the Department c^f Scientific and Industrial Research and Fire 
Offices’ Committee. The authors are indebted to the Director for permission to 
publish this account. 

The authors are also indebted to Dr C. S. Chow, of this Department, for measure¬ 
ments of the* purity of the gases employed and for the detailed analysis of the 
sample of coal gas. 


References 

Coward, H. F. 1919 J. Chem . Soc. 115, 27. 

Coward, H. F. & Brinsley, F. 1914 J. Chem . Soc . 105, 1859. 

Coward, H. F. & Jones, G. W. 1926 Ind. Engng. Chem. 18, 970. 

Coward, H. F. & Jones, G. W. 1938 U.S. Bur. Mines Bull . 279 (revised). 

Dufraisse, C., VieiUefosae, R. & Le Braz, J. 1933 CM. Acad . Sci, Paris , 197, 162, 
Egerton, A. C. <fc Powling, J. 1948 Proc . Boy. Soc. A, 193, 172, 190. 

Goldmann, F. 1929 Z. Pkys. Chem. 5, B, 307. 

Heilbron, I. M. 1934 Dictionary of organic compounds . London: Eyre & Si>ottiswoodo. 
Jones, G. W. 1928 Ind. Engng. Chem. 20, 367. 

Jones, G. W. 1929 U.S. Bur. Mines Tech. Pap. no. 450. 

Jones, G. W. & Perrott, G. St J. 1930 U.S. Bur. Mines Bept. Invest. 3042. 

Jones, G. W. & Scott, F. E. 1946 U.S. Bur. Mines , Bept. Invest. 3908. 

Olsen, J. C. & Graddis, A. H. 1938 Ind. Engng. Chem. 30, 308. 


Decay of isotropic turbulence in the initial period 

By G. K. Batchelor, Trinity College, University of Cambridge 
and A. A. Townsend, Emmanuel College , University of Cambridge 

{Communicated by Sir Geoffrey Taylor , F.R.S.—Received 4 Deceml)er 1947) 

In a previous paper the authors described direct measurements of all the terms in the equation 
for the rate of change of mean square vorticity in isotropic turbulence. The present paper is 
concerned with developments arising from the earlier work and with the experimental 
verification of some recent theoretical investigations. The results of measurements of the 
turbulent intensity u' and of A are presented; those establish that u'~* and A 2 are each pro¬ 
portional to the time of decay provided that the time is not too large. Within this initial 
period of the decay, the double and triple velocity correlation functions are found to maintain 
their form, i.e. to be self-preserving, for small values of the distance r between the two points 
at which the correlations aro takon. For larger separations the doublo velocity correlation 
function changos its form slightly during decay and direct measurements of A and of the 
integral scale L show that A/X increases during the decay. Theoretical predictions about 
the shape of the correlation function, for limited ranges of r, at high and at low Reynolds 
numbers are compared with measurements. 

Theory has shown that the above decay law cannot persist indefinitely, and the present 
experiments confirm that the decay law changes in the expected direction when the time is 
large. A division of the life-history of the turbulence into initial, transition and final periods 
is suggested; within the initial period, a classification based on the Reynolds number is also 
possible. Some speculations on the interpretation of the initial period are presented. 




540 


G. K. Batchelor and A. A. Townsend 


1. Inteoduction 

In a previous paper (Batchelor & Townsend 1947 —referred to hereafter as I) the 
authors have analyzed measurements of the rate of change of mean square vorticity 
in isotropic turbulence. It has been shown theoretically that two separate physical 
processes contribute to the change in the vorticity. The first process is an average 
extension of the vortex lines due to the random diffusive motion, giving a positive 
contribution to where o / 2 is the mean square of any vorticity component, 

which is directly related to the skewness of the probability distribution of the rate 
of extension of fluid elements aligned in any direction, The second process is a dis¬ 
sipation of the vorticity due to the effect of viscosity, and the contribution to 
d(i> ,% ldt depends on the fourth derivative at the origin of the double velocity correla¬ 
tion function. Both of these contributions and the value of dco^jdt itself were 
measured at different stages in the decay of the turbulence. The agreement made it 
clear that it is permissible to apply the theory of isotropic turbulence to the motion 
behind a grid in a uniform stream and that the measurements described were 
essentially correct. 

The plan of measuring all terms in an equation derived from the property of 
isotropy was also adopted for the energy balance. This equation contains only the 
mean energy per unit mass, f a' 2 , and the dissipation length parameter A, in addition 
to the independent variable t representing time. w '~ 2 and A 2 were found to increase 
linearly with t, so that the Reynolds number R A ~ u'Ajv, was approximately con¬ 
stant for the range of grid Reynolds numbers and decay times used in the experi¬ 
ments. Once again the measurements were found to satisfy the equation. 

Another result which emerged from the experiments was that there is some mea¬ 
sure of self-preservation of shape of the functions f(r) and lc(r) which describe the 
double and triple velocity correlations, / and k were found to be functions of the 
variable r/A alone, not depending directly on so far as teyas up to the fourth and 
third degree respectively in their expansions in powers of r are ooncemed. This 
property of self-preservation of the correlation functions is intimately related to 
the law of energy decay. One of us has therefore made a full theoretical analysis 
(Batchelor 1948 —to be referred to as II) of the consequences of various kinds of 
self-preservation hypotheses under different Reynolds number conditions, and it 
hair been found possible to make a number of predictions. 

The present paper is in a direct sequence with the work of I and II. Its first aim 
is to determine the validity of the energy decay law reported in I over as wide a range 
of grid Reynolds numbers and decay times as the wind tunnel in the Cavendish 
Laboratory permits. The second aim is to examine the double velocity correlation 
at different decay times in order to see the extent of the property of self-preservation 
of shape. Finally, as many of the theoretical deductions of II as come within the 
soope of the measurements will be put to the test. 

The measurements of the various statistical quantities associated with the tur¬ 
bulence have been made wholly with a hot-wire anemometer and wherever possible, 



541 


Decay of isotropic turbulence in the initial period 

direct measurements obtained by a suitable electrical analysis of the wire output 
have been preferred. In this way direct measurements of u', A, k% A 3 and /o T A 4 as 

/* CO 

described in I, and of fdr, g(r) and k(r) have been made. Details of the methods 
Jo 

are described fully in a separate paper by one of us (Townsend 1947). The wind 
tunnel used in the experiments is situated in the Cavendish Laboratory, Cambridge, 
and has been described in I, As is customary, the time of decay in the idealized 
problem of homogeneous isotropic turbulence will be identified with the time x\U 
which has elapsed since any point distance x from the turbulence-producing grid 
and moving with the mean stream was at the grid. 

2. Notation to be used 

%’* = mean square fluctuation of any velocity component. 
to' 2 = 5 w' a /A 2 = mean square fluctuation of any vorticity component. 
x — distance from the grid. 

M * periodic spacing of the turbulence-producing grid. 
d = diameter of the rods comprising the grid. 

V = velocity of the uniform stream relative to the grid. 

R m ~ UMjv : Reynolds number associated with the grid. 

f( r )> J/( r )> double and triple velocity correlation coefficients in the sense used 
in I and first defined by v. K&rm&n & Howarth (1938). 



\jr = r/A. 

u'Ajv: Reynolds number of the turbulence. 

/*o0 /*0O 

L - I fdr = 2 gdr: integral scale of the turbulence. 

J} Jo 

-i 

; skewness (with sign reversed) of the proba¬ 
bility distribution of the rate of extension in any direction. 

I = maximum value of r for which self-preservation of /, g and k occurs during 
the initial period. 

C D » drag coefficient of grid. 

3. The significance of the ‘initial period of decay' 

As a result of the experiments described in I and the theoretical analysis in II, 
it has become clear that there are two distinct phases in the life-history of isotropic 
turbulence produced by a grid, with a transitional phase connecting them. The 
phase which may be termed the initial period of decay is characterized by the 
existence of the energy decay law 

u'”*act t A* =10 vt. 



(3-1) 



542 


Q . K. Batchelor and A, A, Townsend 


The measurements reported in I were made behind several grids (M » 1*27, 2*54 
and 5*08 cm.) over ranges which lay within the limits xjM — 20 to xjM *= 130 and 
the turbulence was in every case evidently within the initial period of decay. The 
range xjM < 20 is of course excluded from the system of classification sinoe the 
turbulence takes a little time to become uniform (in the lateral direction) and 
isotropic. It remains to be seen whether it is possible to define an initial period of 
decay by the laws (3* 1) for all values of the grid Reynolds number. The experiments 
of I covered the range R M = 5*5 x 10 3 to 4*4 x 10 4 and experiments to be described 
herein confirm the validity of the laws (3*1) in the initial period of decay for values 
of R m at least down to 600. 

Now in II it was shown theoretically that the only possible regime at very large 
times of decay is one in which the correlation functions are completely self-preserving 
and the law of energy decay is u >_ 2 ^ ({% A2 _ m 

The double velocity correlation function has the form 

f(r) = (3*3) 

and the effect of the triple correlation is negligible. The authors propose that this 
phase be termed the filial period of decay . There will also be a transitional period 
of decay when the mechanics of the turbulent motion is changing in type. As yet 
there is no definite information about the magnitudes of the decay times defining 
the extent of the different phases, but the experiments described below and others 
previously reported fix upper limits to the initial period for a few values of R M . 

It is possible to make, for the initial period of this fundamental division of the life- 
history of the turbulence into three phases, an equally fundamental classification 
depending on the magnitude of the Reynolds number. The basis for the classification 
lies in the relative effects on the turbulence of the inertia and viscous terms of the 
Navier-Stokes equations and a convenient analytical criterion can be obtained 
from the equation for decay of mean square vorticity. It was seen in I that this 
equation can be written as 

7 /./a o 

(3*4) 


giggly» /q TA4 


14 dt 




-a> 


Using the decay law (3-1) appropriate to the initial period, the left side becomes 
30 o)' 3 

anc * “ ,ee< l uatl0n reduces to 

The first term on the right side is derived from the inertia terms of the Navier- 
Stokes equations and represents the creation of mean square vorticity due to the 
diffusive increase in length of vortex lines. The second term on the right side repre¬ 
sents the destruction of vorticity due to viscosity. Thus, the ratio of the viscous to 
the inertia effect is 


2/p v A 4 
s R x 


1 + 


60 

7 sR x 


( 3 * 6 ) 




543 


Decay of isotropic turbulence in the initial period 

The experiments of I established for the initial period of decay and 

0*5 x 10 4 < R m < 4*5 x 10 4 

that the skewness factor s is approximately an absolute constant with a value close 
to 0*39. Hence the factor which governs the relative importance of the viscous and 
inertia effects on the rate of change of mean square vorticity is 

22 

!+]£■ (3-7) 

The number R x clearly plays a fundamental role in the theory of isotropic tur¬ 
bulence and the authors propose to call it the Reynolds number of the turbulence. 
Since R x is constant during the initial period of the decay, the value of R x is a con¬ 
venient and significant index to the type of turbulence occurring throughout that 
period. Three roughly separate regimes depending on the magnitude of R x can be 
distinguished. The regime of large Reynolds numbers of turbulence is such that 
22/R x <^l and the contributions to do)' 2 jdt from the viscous and inertia terms are 
approximately equal and opposite. At intermediate Reynolds numbers the factor 
22jR x is of order unity and the ratio of the two contributions to d<o ,2 /dt is significantly 
different from unity. Finally, at small Reynolds numbers of turbulence, 22 /i? A > 1 , 
and the effect of the triple correlations on d(o ,% jdt is negligible compared with the 
viscous dissipation. 

Each of these three regimes has its own body of theory. The existence of large 
Reynolds numbers of turbulence is basic to the theory of locally isotropic turbulence, 
first put forward by Kolmogoroff ( 1941 ) and developed more recently by one of us 
(Batchelor 1947 ). There is no need to describe here the results of this theory except 
to remark that it predicts that the skewness factor s is an absolute constant, and the 
experimental evidence (see I) for this result in the range of intermediate Reynolds 
numbers thus shows that there is some overlapping of the Reynolds number 
categories. There are other results of the theory, e.g. concerning the form of the 
function /(r), which do not appear to be true at intermediate Reynolds numbers. 
There is as yet insufficient evidence to be certain that the energy decay law (3*1), 
which has been taken to define the initial period of decay, continues to hold as R x 
becomes large (the measurements described below confirm it as far as R x - 60), 
nor is this decay law necessary to the theory of locally isotropic turbulence, so that 
this part of the classification is provisional. 

No unqualified deductions about the turbulence at intermediate Reynolds numbers 
have been made, but a few results have been obtained in II on the assumption of 
limited self-preservation of the correlation functions, i.e. that / and k are functions 
of r/A only, provided 0 ^ r < i, where l is an unspecified length which is not too large. 
There were found to be two possible ways of proceeding from this assumption. 
According to the first method of solution the law of energy decay is given by (3*1), 
whereas the second method gives a different decay law which asymptotes to (3*1) 



544 


G. K. Batchelor and A. A. Townsend 


at t increases. Since there is no means of predicting the rate of approach to (3*1), 
measurement of the energy decay law is not sufficient to choose between the two 
solutions. However they make different predictions about the correlation functions 
and the evidence presented below is sufficient to show that one solution is correct. 
When R k is small, it has been found possible (see II) to predict the form of the 
function/(r) for as large a range of r as/(r) is self-preserving during decay. 

The experiments reported in I and those to be described below are concerned 
largely with the initial period of decay and with intermediate Reynolds numbers 
of turbulence. For three values of R M the measurements have been made over 
a decay range which is sufficiently large to extend beyond the initial period and into 
the transitional period. Some results have been obtained under the conditions of 
large and small Reynolds numbers of turbulence and these show quite good agree¬ 
ment with the appropriate theoretical deductions. 


4. R4sum6 of theoretical results to be examined 

The deductions which will be compared with experiment are set out in the fol¬ 
lowing table. Reference should be made to the papers quoted for other deductions, 
from the same hypotheses, which do not lie within the scope of the experiments. 


conditions hypothesis 


deductions 


initial period of decay 

~~ 4,1 Kolmogoroff’s 

A similarity 

hypotheses 

g = 0(i> /(r)a/( r/A), 

k(r) ss fc(r/A), 
for 0 ^r<l 


22 

Ra 


>1 


/(r)®/(r/A), 
for 0 <r<Z 


(а) g(r) = 1 - §C^~ »for r where C is an absolute 

constant 

( б ) A£A* and /J v A 4 are constant if 

is constant during decay 

(c) b is an absolute constant 
either (a) = constant during decay, and 

/'+/'(J+y) + V + * s a(*' + J*)= 0 ’ for °< r<l 

(dashes denote differentiation with respect to \jr) f 

or ( b ) an energy decay law which asymptotes to 
Rx = constant, 

and f + jJ' + (i+ &*Ra) W +5/ = 0 .) 

V . V for 0<r<l 

■ J 

J? A = constant, and f +/'(t + —j + 5/ = 0 , 
for 0 <r<l 


see I for 
verification 


final period of decay 


v'~ , oc J 1 , A* as 4 vt and f(r) » 



Decay of isotropic turbulence in the initial period 


645 


5. Measurements of u ' and A 

Measurements of the quantities u' and A which occur in the energy equation, viz. 

du’ 2 10 vu f% 


dt 


A 2 


(5-1) 


were made at various distances from a turbulence-producing grid. Four similarly 
shaped biplane grids constructed of circular cylinders of diameter 3Af/16 were used; 
the values of Af were 5-08,2-54,1*27 and 0*635 cm. Measurements of A were corrected 
for finite wire-length. The data are presented in figures 1 to 4. These diagrams 
contain the results described in I, together with new results for the smallest grid, 
and for the larger grids at different wind speeds. They summarize all the authors’ 
experiments concerning the decay law (3*1), which is regarded as a characteristic 


of the initial period of decay. 



x/M 


u'* X 

Figure 1. — vs. —- (U = 643 om.sec.- 1 ). 
U* M 

x M = 0*636 cm. • M « 1*27 cm. 



0 100 200 300 


x/M 

w'* x 

Figure 2. vs. - (U = 1286 cm.sec.- 1 ). 
U* M 

4- M = 2*64 cm. 0 M = 6*08 cm. 


It will be seen from figures 1, 2 and 3 that most of the measurements of U 2 /u' z 
lie about a linear variation with xjM ; only at the large values of x/M obtained with 
the smallest mesh (M « 0*625 cm.) do the points depart appreciably from a straight 
line. It is thus possible to write 


m'* \4f M ) 


(5-2) 





546 


G* K. Batchelor and A* A, Townsend 


for an initial range of values of xjM> where a and x 0 are constants. The energy equa¬ 
tion (5-1) then requires 


A 2 » 


IQp, , 

jj (# #o) 


lOJf 2 /# ar 0 \ 


(5-3) 


the rather curious but well-known result that A 2 depends only on the viscosity and 
time of decay—depending not at all on M or V directly—is a consequence of any 
decay law which makes u ' 2 vary as some power of x or t . In the plot of A 2 vs. x in 
figure 4, lines are drawn with slope 10v/£7 and they represent well the experimental 



u' s X 

FlGtTKE 3. -1 

(U = 150 cm.sec." 1 , M = 0*635 cm.) 



Curves I and II, slope = 2*26 cm. = -^- 

(U = 643 cm.seo.- 1 ). 

Curve III, slope =1*13 om. = 

(17= 1286 cm.seo.- 1 ). 

x M = 0*635 om. • M = 1*27 cm. 

+ M — 2*54 cm. 0 M = 6*08 om. 


variation, with the exception of the range xjM >180 obtained with the smallest 
mesh. The experimental points would in no case be represented better by a straight 
line of slope more than 10 % different from lOvjU. Thus the power of (x~~x 0 ) which 
represents the variation of u'~ 2 in these experiments does not differ from unity by 
more than 10 %. 

The energy equation also requires the extrapolations of U 2 /u' % and A 2 to vanish 
at the same value of x . This virtual origin at x = a: 0 represents the position at which 






547 


Decay of isotropic turbulence in the initial period 

the turbulence is effectively created with infinite energy. Values of x 0 given by the 
straight lines best fitting observations of U 2 ju ,% and A a (in the case of A* the slope of 
the lines was taken as IQv/U and only the position was varied to improve the fit) 
are set out in table 1 and are approximately ,in accord. 


Table 1. Measurements oe u' and A behind similar grids 


M 

V 


tJL) 

( x \ 

d(U*/u'*) 


(cm.) ■ 

(cm.sec. -1 ) 

Rm 

\M/utu'-o 

Wa-o 

d{x/M) 


0*635 

643 

0-28 x 10* 

20 

12 

128 

14*4 

1*27 

643 

0*55 

12 

13 

132 

20*3 

2*54 

643 

1*10 

10 

7 

130 

28*7 

5*08 

643 

2*21 

3 

5 

— 

41 

0*635 

1286 

0*55 

9 

— 

140 

20*3 

1*27 

1286 

1*10 

15 

5 

151 

28*7 

2*54 

1286 

2*21 

8 

3 

126 

41 

5*08 

1286 

4*42 

6 

3 

— 

58 


Many investigators in the past have found that U 2 /u' 2 when plotted as a function 
of x/M appears to be approximately independent of U and M for similar grids. The 
present measurements support this conclusion. Apart from the small change in 
virtual origin of the turbulence with R M , there is a little variation (apparently 
random) in the slope factor a as shown in table 1. 

From (5-2) and (5-3) R x = ~ , (6-4) 

and R x is constant during the initial period of decay. Knowledge of a for the types of 
grids commonly used in experiments therefore enables i? A to be calculated for given 
values of U and M and as has already been seen, the value of f? A is a significant index 
to the state of the turbulence. The data in table 1 show that the average value of a 
for the various values of M and U is 134 (no slope factor is deduced from measure¬ 
ments with M — 5*08 cm. since these could only be made over a very small range of 
values of xjM ); this figure is appropriate to biplane grids for which the ratio of rod 
spacing M to diameter d is The values of R x calculated from (5*4) with a - 134 
are shown in the last column of table 1. They represent well the values of u'Ajv 
obtained from the measurements of u f and A except for the smallest values of xjM 
where the influence of the disparities in the virtual origins appropriate to the E7 a /«' a 
and A a curves is strong. 

Measurements behind the grid of smallest M extended beyond the range of validity 
of the decay law (5*2). For xjM > 180 the measured values of J7 2 /u' 2 and A 2 at all 
speeds show a rate, of energy decay which is more rapid than that which holds in the 
initial period. The extent of the initial period of decay will probably depend on the 
grid Reynolds number and on the initial correlation functions created by the grid, 
but the tunnel available to the authors is too short to allow any systematic results 


Vol* X93* A* 


36 




548 


G. K. Batchelor and A. A, Townsend 


to be established. There appears to be a slow diminution in the extent of the initial 
period as R M increases (see table 2 below). 

Dryden ( 1943 ) has discussed the measurements of u' published by several authors 
and has compared them with the decay law (5*2). He finds moderately good agree¬ 
ment in all cases, particularly at the smaller values of zjM> In some cases the devia¬ 
tions found by Dryden can be interpreted as the beginning of the transitional period 
of decay. This is almost certainly true of v. K&rm&n’s data. Von Kdrm&n ( 1938 ) has 
presented measurements of u' behind four grids of the woven wire type with different 
values of Mjd which, when plotted in the appropriate manner, show that U 2 ju ' a is 
initially proportional to xjM, but in some cases increases more rapidly for larger 
values oixjM. The essential features of v. K&rm&n’s data are set out in table 2 for 
oomparison with the measurements given by the authors. The point at which the 
linear decay law ceases to be valid is necessarily rather indefinite and the estimates 
given in the table are only useful as a guide. 


Table 2. Grids of different shape 


source 

M 

(cm.) 

M/d 

UMjv 

v. K Arman 

1*27 

2*10 

1*00 x 10* 

v. KArm&n 

1*27 

4*76 

1-41 

v. KArmAn 

1*27 

4*76 

1*00 

v. KArmAn 

1*27 

6*04 

2*00 

v. KArmAn 

1*27 

6*04 

1*00 

v. KArmAn 

1*27 

11*6 

2 *00\ 

1 *00/ 

present authors 

0*035 

5*33 

0*004 

present authors 

0*635 

5*33 

0*28 

present authors 

0*635 

5*33 

0*56 


upper limit 
to extent of 

rf(E7»/tt' 2 ) . 

d(xjdj c [ 

~d(U t /u' t )‘ 

initial period 

initial period [ 

_ d(x/M) t 

x/M >210 

7*0 

128 

x/M = 229 (?) 

3*1 

100 

137 

20*8 

96 

113 

23*0 

90 

113 

23*0 

90 


no discernible initial period 

average for average for 


120 ) 

different 

different 

180 \ 

M and U 

M and U 

200 J 

(see table 1) 

= 105 


= 25*1 



It has already been mentioned that the slope factor a of equation (5-2) appears 
to be independent of R M for similar grids. It must thus be wholly determined by the 
characteristics of the grid which produoes the turbulence. Two characteristics of 
the grid are relevant. The first is the drag per unit area, since it is the work done 
against this force which governs the amount of turbulent energy created by the grid. 
The second is the geometry of the grid, since this determines the length character¬ 
istics of the initial turbulence. These grid effects may be taken into account by writing 


u'*~C d \M' mt 


(5-5) 


where fi is an absolute constant, C D . $pU z is the drag of unit cross-sectional area of 
the grid and M r is an effective unit of length, M* will depend more on the spacing of 
the grid elements than on their dimensions. 



649 


Decay of isotropic turbulence in the initial period 

Now when Mjd is large, the drag of each grid element will be unaffected by the 
presence of its neighbours and C D is proportional to djM. This explains v. K&rm&n’s 
observation that his measurements of U 2 ju ' 2 for the various grids came closer to 
coincidence when plotted against x/d rather than against xjM. However, table 2 

d(U 2 iu ,% ) 

shows there is still considerable variation in the value of ■ . . for the smaller 

d(x/d) 

values of Mjd where the interference drag of the grid elements is appreciable. There 
have been many measurements of the drag of square grids of parallel circular 
cylinders in the past, and the formula which seems to represent the data best is* 



Using this formula, the values of C D for the grids used by v. K&rm&n and the present 

authors have been estimated and the last column of table 2 shows that the values 

Cd(U 2 tu f 2 Y\ d(U 2 lu'*) 

of C v \ . . jrr~ are more nearly constant than those of v , - ~. The agreement 


"t d{x/M) J 


d(x/d) 


is now as good as could be expected in view of the uncertainty of the estimates of 
C D and of the difficulty in making absolute measurements of u' ; hence, so far as 
can be told from the data in table 2, the effective mesh length M‘ ' in (5*5) is approxi¬ 
mately equal to M . (3 is given approximately by the average of the figures in the 
last column, with a total weight of unity for each group of readings which apply to 
one value of Mjd, and (5*5) can be written approximately as 


V 2 _ 106/ x 
u' 2 ~C d \M m)‘ 


(5*7) 


Further measurements are necessary before this relation can be applied with 
confidence to grids of arbitrary shape. 

The formula (5*7) can also be written as 


which is more appropriate to the idealized problem of the decay of strictly homo¬ 
geneous turbulence, since the factors related to the grid do not occur explicitly. 
u' 0 2 is the initial intensity of the turbulenoe created by the grid and is proportional 
to C D U 2 . t 0 is the time interval between the instant at which the extrapolated 
intensity would have been infinite and the initial instant at which it is Evidently 

is proportional to MjU, as would be expected on the grounds that M and U are 
the factors most relevant to a time scale. 


* See L. F. G. Simmons <& C. F. Cowdray, Measurements of the aerodynamic forces acting on 
porous screens, Aero. Res. Court.'Unpub. Rep . 8920 (1945). 


36 -a 




550 


G. K. Batchelor and A. A. Townsend 


0. Measurements of g{r) 

(a) Limited self-preservation of g(r) 

The two double velocity correlation functions f(r) and g(r) are related by the 
continuity condition, viz. 

• «►>> 

and a measurement of either is sufficient to determine the general double velocity 
correlation. Measurement of the lateral correlation is more convenient practically, 
and most of the g{r) curve has been determined at a number of distances downstream 
from the grid for several values of V ^nd M . Figures 5 to 8 exhibit the results. Each 
diagram refers to a single value of R M and shows a family of curves of g against r/A, 
each member of the family referring to a particular stage in the decay. The relative 
positions of the curves are sensitive to the values of A at each value of x/M and so 
A has been obtained from a smoothed experimental curve of A against x/M. The 
method of measurement of g was less accurate at small values of r, but the parabolic 
variation of g near the origin has previously been verified and may be assumed 
with confidence. 

Accurate self-preservation of the function g seems to hold for r/A lying between 
0 and some figure of the order of 1, the range being larger at the higher Reynolds 
numbers. A more satisfactory method of determining theextent of the self-preserving 
property is to measure directly the derivatives of f(r) at r =* 0 at different distances 
from the grid. Measurements of/o v A 4 , described in I, were found to be approximately 
constant during the decay. However, as mentioned in that paper, direct measure¬ 
ment of higher derivatives presents considerable difficulties. 

The chief feature of each family is that, apart from within the self-preserving 
region, the values of r/A corresponding to a given value of g decrease with increase 
of x/M. That is to say, outside the self-preserving region, the curves g{\jr) of any 
family have roughly the same shape but contract laterally as x/M increases. Note 
however, that the change with x/M in the outer parts of the curves cannot be 
described exactly by such a contraction, for each curve must satisfy the identity 


J o i/rg(ft)dijr = 0 


deduced from (6'1). The correlation funotions/and g would be expected to diminish 
in absolute value at large values of r/A with increase of x\M, since it was shown 
in II that 

u'*\fr*dr, = A, 


is constant during decay, and consequently, from (6‘1), 


J o J o V^W) dtjr • , 


oc< - *. 



551 


Decay of isotropic turbulence in the initial period 

This relation is clearly consistent with a tendency for the curves of a family to 
contract slightly as t , or x/M , increases, since the absolute value of g diminishes with 
increase of r/A when rj A is large. 

• x}M « 40, A - 0*208 cm. x x)M 160, A = 0*450 cm. 

+ x/M = 80, A = 0-298 cm. □ jr/Jtf » 200, A » 0*505 cm 



r/A 

Figure 5. Transverse correlation. U = 643 cm.sec.-*, Af = 0*635 cm., 

t 



Figure 6. Transverse correlation. U = 643 cm.sec.- 1 , Af = 1*27 cm. 


/*00 

It follows from these considerations that the area g{fr)dft diminishes with 

1 

xjM, and thence that the integral scale L y « I /dr * 2 gdr t increases with 

Jo Jo 

xjM less rapidly than does A. This is borne out by direct measurements of L f the 



552 


G. K. Batchelor and A. A. Townsend 


results of which are shown in figures 10 and 11 . Figure 10 shows (L/M)* plotted 
against xjM and on the basis of similar evidence Dryden ( 1943 ) has represented the 
experimental variation by a straight line. At best this is an approximation, since 


10 
0-8 
0 6 
g 0-4 
0-2 
0 

- 0-2 


V 


[V'« 


• x/M - 20, A « 0*254 cm, ® x/M « 60, A » 0*546 cm. 

+ x/M ~ 40, A = 0*421 cm. Q] 2nd solution deduced from 

limited self-preservation. 










1 


2 3 4 5 6 7 8 9 

r/A 

Figure 7. Transverse correlation. V = 643 cm.see.- 1 , M = 2*54 cm. 


10 



Figure 8. Transverse correlation. U = 1286 cm.eec.-* 1 , M as 1*27 cm. 

an exact linear variation would imply complete self-preservation. It is doubtful 
whether the variation of L with x obeys any simple power law. Figure 11 shows that 
XIL increases most rapidly when x/M is small, and is consistent with figures 5 to 8 . 
The value of X/L in the final period of decay is found from (3*3) to be ^( 2 /tt) ** 0*80, 



(L/M) 


553 


Decay of isotropic turbulence in the initial period 

and. this is presumably the asymptote toward which the ourves of figure 11 ultimately 
tend. A point of interest about the measurements of scale is that at the virtual origin 
of the turbulence, L\M is in the neighbourhood of 0'3 or 0*4. 



r/A 

Figure 9. Transverse correlation. V - 1286 om.sec.- J , M — 5-08 cm. 
x/M = 20, A = 0-316 cm. 










554 


G. K* Batchelor and A. A. Townsend 


(6) Predictions about g 

It was stated in II that measurements of g would help to establish the validity of 
one of the two alternative deductions from the postulate of limited self-preservation. 
The first solution, which gives a decay law such that jB a is constant, makes no 
prediction about for g, but the second solution gives a decay law such that R A tends 
to a constant together with the equation 

r+l/'+(t+iVw/'+5/=o (»■<*). (0-4) 

where s is the skewness factor found in I to be an absolute constant close to 0*39, 
and R x in (0-4) represents the asymptotic value. The solution of (6*4) can be obtained 
in the form of an indefinite integral and has been computed for a number of the 
values of R x at which measurements of g were made. Comparison of the measured 
and calculated curves of g as a function of i/r for one value of R A is shown in figure 7 
and is typical of comparisons at other Reynolds numbers. The agreement is not 
good, particularly when it is recalled that one of the constants in the solution was 
chosen to make/o v A 4 agree with the value found by direct measurement (see I and II). 
Thus there is reason to reject the second of the two alternative deductions from 
limited self-preservation and further evidence given in §7 shows that the first 
solution is valid. 

(c) Solution for small values of R K 

A deduction from the postulate of limited self-preservation which is consistent 
with either of the solutions just mentioned is that, in the initial period of decay, 

r+/'(|+^)+5/-° (0-5) 

for r < l, when R k is sufficiently small. ThiB equation describes the effect of viscosity 
on the smaller eddies of the turbulenoe in the initial period. The solution which 
makeB/ = 1 at = 0 is rl 

f(i / ) = <It- (0*6) 


The corresponding solution for g(\p) can be obtained from (6-1) and numerical values 
are shown in figure 5, This figure also contains the measured values of g at the lowest 
Reynolds number used in the correlation experiments. The agreement is good for 
r/A < 1*5, except perhaps for the measurements close to the grid where accurate 
values of A are difficult to obtain. The measured values of g become negative in the 
neighbourhood of r/A = 2 whereas the solution (6-6) is everywhere positive; this 
sets an upper limit to the region of self-preservation. 

Some departure from the solution (6-6) at xjM — 200 might be expected, since 
it has already been found that this position is slightly outside the initial period. The 
solution appropriate to the final period is obtained from (3-3) and (6*1) as 


9 



e -(r»/aA«) 


(6*7) 



555 


Decay of isotropic turbulence in the initial period 

and this curve is shown in figure 5 for comparison. The departures from the solution 
( 0 * 6 ) at large values of xjM are in the direction to be expected from an approach 
to (6*7). 

The value of R x for the measurements in figure 5 was 14*4, and the range of‘small ’ 
Reynolds numbers of turbulence such that the inertia forces play no part in the 
determination of the correlation function g (or/) therefore has its upper limit near 
this value. However, at this value of R K the ratio of the viscous to the inertia effect 
in the balance of mean square vorticity is only 2*5, so that the inner region of the 
correlation curve must be less sensitive to the effect of the inertia forces than the 
vorticity. 

{d) Large Reynolds numbers 

The present experiments were not designed to make possible a comparison of 
KolmogorofFs theory of local isotropy with experiment, but the single measurement 
of the function g made at R M =» 4*42 x 10 4 (see figure 9) may be used for this purpose. 
R x is here equal to 58 so that the ratio of the viscous to the inertia effect in the 
balance of vorticity is equal to 1*38. The theory predicts (Kolmogoroff 1941 ; 
Batchelor 1947 ) that /15 r \ I 

g{r) = 1 -K'(t^) ( 6 -8) 

provided y<^r<^.L f where G is an absolute constant, y is a length of order Xj^R K 
and L is the integral scale of turbulence. The curve of this shape which appears to 
fit the observations quite well is 

< 7 (r)= l-0-36^\ 

The prediction ( 6 * 8 ) is thus successful if 

C = 0-54^1) = 1-33, 

which is quite close to the value 1-5 suggested by Kolmogoroff from a comparison 
with measurements made by Dryden. Other predictions from the theory of local 
isotropy (Batchelor 1947 ) have already been verified by the experiments described 
in I. 


7. Measurement op the triple correlation 


It was shown in II that the two alternative deductions from the hypothesis of 
limited self-preservation lead to different predictions about the triple velocity 
correlation function k. According to the first solution, the decay law is such that 
is constant, and .. , /A , , , 

Mr*) +,|f+ 4v)" 0 

for r<l. Solving (7-1) for k, 


(7-1) 


(7-2) 



556 


G. K. Batchelor and A. A. Townsend 


so that if / is measured, k can be calculated. On the other hand, the second solution 
leads to a deoay law which makes R x tend to a constant value, and to the equations 

/' + ^/' + (£ + 0-23W/' + 5/=0, (7-3) 

0*46 r 00 

k = l^Jo rrd ^ (7 ’ 4) 

which can be solved to give both / and k in terras of J? A . Both solutions give the 
correct value of k %. 



Figure 12. Triple correlation, -f Experimental. 0 1st solution; 
x 2nd solution, both deduced from limited ^elf-preservation. 

A measurement of k for various values of r therefore provides a means of choosing 
between the two alternative deductions from the hypothesis of limited self- 
preservation. Figure 12 shows measurements of k at xjM » 40, for M = 2*54 cm., 
[7 = 643 cm.sec.^ and while no great accuracy is claimed for the measurements, 
they make it clear that the first solution is correct. The function fc calculated from 
(7*2) and (6-1) by numerical integration of the values of g measured under the same 
conditions is shown in figure 12 and has the same shape as the measured curve for 
r/A < 4. The ordinates of the calculated and measured curves differ in magnitude but 
this is probably due to the sensitivity of the integral in (7-2) to an error in the value 
of A. Calculations of k from (7*3) and (7*4) with the appropriate value of ii A are shown 
in figure 12 and do not give a curve of the correct shape. Hence, from the evidence 
of this and preceding section, it may be concluded that the initial period of decay 
is characterized by a decay law such that R x is constant and by the relation (7*1) 
between the double and triple correlations. 



557 


Demy of isotropic turbulence in the initial period 

The measurements of k shown in figure 12 have considerable intrinsic interest, 
although the authors hope to b© able to make more accurate determinations. When 
r is small, k is necessarily proportional to r 8 , and the curvature evidently decreases 
to zero rapidly as r increases. The maximum value of k is near r/A » 2 * 6 , and is 
followed by a slow fall to zero as r become large. The measurements of k are every- 

f 00 

where negative, which is the basis for an assumption that kdr < 0 used in II in 

Jo 

the deduction that a completely self-preserving solution must occur when the time 
of decay'is large. 


8. Interpretation of the initial period 

This paper would not be complete without some consideration of the physical 
processes occurring in the initial period of decay, although such considerations can 
be little more than speculation at the present time. The fundamental difficulty is 
to explain or deduce the law of energy decay found for this period. No acceptable 
deduction of this decay law—except that based on the assumption of limited 
self-preservation, which merely transfers the difficulty—has yet been put forward. 
Dimensional arguments have been used to deduoe the energy decay law (Taylor 
1935 ; Dry den 1943 ), but these assume that there is a single length characteristic of 
the turbulence and the experiments herein show this to be untrue. Suoh arguments 
can be successful if, and only if, they can show that the total amount of work done 
against Reynolds stresses created by the smaller scale velocity fluctuations is 
proportional to w' 8 /A. Since the dissipation of energy by viscosity is proportional 
to Fw' a /A a , consistency of the two expressions leads to i? A =* constant. 

But once the decay law of the initial period is accepted, it is possible to under¬ 
stand—in the mathematical sense, at all events—why the period cannot continue 
indefinitely. For instance it was shown (see (6-3)) that 

J o i oc t~* 

in the initial period. Near the origin, f(\j/) * 1 - »o that if / is to remain single¬ 

valued and positive, the above integral cannot decrease beyond a certain limit. 
Physically, the termination of the initial period seems to be a consequence of a 
continual narrowing of the range of sizes of eddies comprising the energy spectrum. 
At one end of the scale there is the length A, which increases as <Jt. A “ 2 is the mean 
square wave-number of sinusoidal velocity fluctuations, weighted according to 
their contribution to the total energy, so that A is representative of the eddies of large 
wave-number, i.e. of the small eddies. At the other end of the scale is the integral 
scale L t representative of the largest eddies. L increases less rapidly than A, so that 
the range is continually diminishing. A crisis develops, and whatever its nature, 
the net result is to reduce the rate of increase of A. 

The larger eddies lose their energy chiefly through the action of the inertia terms 
in the N&vier-Stokes equations by transfer to smaller eddies, whereas the smaller 



558 


G. K. Batchelor and A. A* Townsend 


eddies are dissipated chiefly by the action of viscosity. At the end of a long period 
of decay, when the disparity in size between the largest and smallest eddies has 
been reduoed appreciably and the Reynolds number of the larger eddies has 
diminished, viscosity becomes a significant factor in the reduction in energy of all 
the eddies. It is reasonable to expect a fundamental change in the decay law then 
to occur. During the transitional period the inertia terms continue to play some part 
in the transfer of energy away from the larger eddies, but the Reynolds number is 
decreasing with time, and in the final period of decay viscosity dominates the 
mechanics of the turbulence. The large negative value at the minimum of the corre¬ 
lation function g which holds in the final period (see equation (6*7) and figure 5) is 
an indication that the eddies existing in the final period are more uniform in type 
and size than in the initial period. 

Thanks are due to Professor Sir Geoffrey Taylor for his stimulating interest in 
the work. During the oourse of the research the authors were in receipt of a Senior 
Studentship (G.K.B.) and an Overseas Scholarship (A.A.T.) from the Royal Com¬ 
mission for the Exhibition of 1851. Component parts of the measuring equipment 
were purchased with money provided by the Aeronautical Research Council. 


References 

Batchelor, G. K. 1947 Proc. Camb. Phil . Soc . 43, 533-559. 

Batohelor, G. K. 1948 Quart. App . Math. 6 (referred to in the text as II). 

Batchelor, G. K. & Townsend, A. A. 1947 Proc. Roy. Soc. A, 190, 534 (referred to as I). 
Dryden, H. L. 1943 Quart. App. Math. 1 , 7-42. 

K&rra&n, T. v. 1938 Proc. 5th Int. Cong. App . Mech. pp, 347-351. 

K&rm&n, T. v. & Howarth, L. 1938 i Proc. Roy. Soc . A, 164, 192-215. 

Kolmogoroff, A. N. 1941 C.R. Acad. Set. U.R.S.S. 30, 301-305 and 32, 16-18. 

Taylor, G. I. 1935 Proc. Roy. Soc. A, 151, Part I, 421-444. 

Townsend, A. A. 1947 Proc. Camb. Phil. Soc. 43, 560-570. 



Interaction of mesons with a potential field 

By J. C. Gunn, University College , London 
(Communicated by H. 8. W. Massey, F.R.S.—Received 0 December 1947) 


The Takata-Saketani wave equation for mosona is developed to allow the solution of a 
variety of problems concerning the behaviour of charged vector mesons in an electrostatic 
field. In particular the form of the radial wave equations is discussed. The general olastio 
scattering problem is also formulated, and the Bom approximation for tho phases is dis¬ 
cussed for the Coulomb field, for which, as is known, no exact solution is possible. The exact 
treatment of scattering by a spherical well is considered. This is the only case for which exact 
treatment in terms of known functions seems practicable, and might be of «ervice in in¬ 
dicating the limits of the validity of approximate processes in other oases. 


1. Introduction 

A variety of formalisms are in current use for the treatment of the electromagnetic 
interaction of mesons. The original wave equation of Proea ( 1936 ) as developed by 
various authors, with its emphasis on the field aspect, is perhaps not the most 
suitable for the single particle problems, which are the subject of this paper. The 
first matrix treatment, due to Kemmer ( 1939 ) is more suitable in this respect, and 
leads to a general formalism quite analogous to tho Dirac electron theory. For 
particular problems it suffers from the defect that of the ten wave function com- 
ponentsfour are redundant (of. Booth& Wilson ( 1940 )). The most suitable formalism 
for problems in which the specific meson interaction is not concerned appears to 
be that developed almost simultaneously by Sakata & Taketani (1940 a, 6 ) and Tamm 
( 1940 ), which shows that, with the elimination of the redundant components from 
the Kemmer theory, the wave equation can still be expressed in Hamiltonian form. 
A clear exposition of the general theory has been given by Heitler ( 1943 ). 

The plan of this paper is, then, the systematic development of the Sakata-Take- 
tani theory, as applied to the various basic one particle meson problems, and in 
particular to the quest for bound states in a given potential field, and to the formula¬ 
tion of the scattering problems, Throughout the development will be very analogous 
to that for the Dirac electron. 

2. Fundamental equations 

We start from the Sakata-Taketani Hamiltonian form, of which a full discussion 
is given by Heitler ( 1943 ) 

H =* e^4-mcV? + /?(l + 3?)p a /2m-/?^(o.p) 2 /w. (1) 

Here the <r’s are the spin matrices for particles of spin I, connected by the relations 

» ie ijk cr k « ia {j and crj « <r iy ( 2 ) 

[ 559 ] 



560 


J. C, Gunn 


where we use the usual double suffix summation convention and e-symbol, ft, y and 
a third matrix e (Heitler T s notation) are related like the usual Pauli 2 x 2 spin matrices, 
and are connected with the meson charge, e and m are the meson charge and rest 
mass and <j> is the field potential. The generalization to a general electro-magnetic 

g, 

field is found on replacing, as usual, p by p —- A, and adding the term 

c 


eh 
2 me 




i 


wl\ere A, H represent the field vector potential, and magnetic intensity respectively. 
The wave equation is of the usual form 

ih^ = Hf, (3) 


where ^ is a vector in the 6 -( ~ 2 x 3 )dimensional space of the /?, a representations. 
An essential difference from the Dirac tlieory is now that H is not Hermitian. This 
is linked with the fact that the charge density is given by p = ijr'fiijr, so that the 

average value of an observable Q is given by jifr'fl Q ft dr. The condition for an operator 

Q to represent an observable is in fact that fiQ be Hermitian, and this condition 
is satisfied by H . 


3. Motion in a central field 

We begin directly with the case of motion in a central field. This can as in the Dirac 
particle case be developed in two ways. We may either, as with Darwin ( 1928 ), 
use the explicit spherical harmonic forms for the angular wave functions, and 
hence derive the radial wave equations, or we may work entirely algebraically 
with the appropriate operators as in Dirac s original treatment. We shall in fact 
compromise to some extent between the two methods. 

In the first instance it may easily be verified that for a spherically symmetric 
field, <j> =* <fi(r), M = m*f 8 ** #(k-f a) is in fact a constant of the motion. Here we 
denote m » r x p = hk for the orbital angular momentum, and s ^ ha for the spin 
angular momentum of the particle. The stationary states, if any, can thus be par¬ 
tially classified by the values of the energy E , total angular momentum M, and 
angular momentum component in a given direction, M x . For the last two we intro¬ 
duce the quantum numbers j and m. 

From general principles we then know that a state of given j and m may be found 
by superposing states with orbital quantum number l * (j - 1 ), j y (j 4- 1 ), i.e. states 
which are separately eigenstates of both m 8 and M a , though the complete wave 
function is not in general an eigenstate of m 2 , which is not a constant of the motion. 
For the algebraic treatment it is convenient to consider the operator (k. o). It may 
easily be verified using the relation 

<r t <r,(r k + cr k (r j <r i - <r<* /fc +cr** y (4) 



561 


Interaction of mesons with a potential field 

deducible from ( 2 ) that 

(k.o) s = ((k.a) + l)k a -2(k.®)*—(k.er) (5) 

or if we introduce K = ((o.k) + 1 ) then K 3 — 1(1+ 1)K + K*. From this it follows 
that the eigenvalues of (a. m)‘ are Ui, —h and — (Z + 1 ) A, corresponding to the states 
j' — 1+ 1 , l, and (l— 1 ) respectively. We shall denote the angular wave functions in 
spin space by the symbol u with appropriate suffices, so that for given j and m we 
have the three independent wave functions 

u i -i, j.m’ u j+lJ.m an< i 

whioh are such that 


(k.o)%_ u , m = (j- l)Vw». (k.o) 

(k. O) = - (j + 2 ) ® J+Ur ( 6 ) 

If we take an explioit representation for the <r in which, say, cr z is diagonal with matrix 


n o ov 
cr^lo -1 o), 
\0 0 0 / 


and the phase factors are chosen so that 


/0 0 1\ /0 0 -i\ 

o- x =l/V2 0 0 1), <r„= 1/^2 0 0 i), 

\l 1 0/ \i -i 0/ 


(7) 


then the expressions for the normalized angular eigenfunctions are easily deduced: 

( li(j + m)(j + m-\)\ v 


, m ~1 


U 


j~U.m 


m) 


v 

l j~hn 


U 




j+l,m +1 h 




l((j -m- 1 )(j- 

VI 2 j( 2 j - 1 ) 

H(j~m)(j + m)\ v 

M (j± 1 -mUJ+ 2-m)) Y 
2(T+l)(2j + 3) ""| 

II 0 +i±f^01+ 2 + m)l „ 

Vi 2(j+l)W+3T“i 1 

U (j +l-m)( j + 1 + m)l „ 

0+ l)(2i + 3) ~ j i+l - 

Vi Wi) r"”*- 1 \ 

/1 Q - m ) 0 + wl + ] ) | Y 

V 1 ' 2j0+l) i <• 


V00+!)}'•“ 


( 8 ) 




502 J. C. Gunn 

Here Y l m is used to denote the normalized spherical harmonic, defined by 


1 


Yi,M) « 

//07 1 i 

where 

is the normalized associated Legendre function (m positive or negative). 


. H21 +\(l- m)!1(1 -**)*»• d'+ m . , lxl , 


4. OrERATORS (o.p) a , (o.x) a , ETC. 

In the wave equation (3) the operator (o .p) 2 occurs, and is indeed the chief novelty 
of the meson theory. It is the term called by Heitler ( 1943 ) the gyroscopic term, 
describing the motion of a gyroscope with inertia tensor hk = <T i <T k + o , k <r i . The 
direct evaluation of this operator applied to the angular wave-functions is a matter 
of some elaboration, and an algebraic approach is preferable. The operators (o.x) a 
and (0 .x) (0 .p), which also occur in the theory may be considered at the same time. 
Corresponding to the eigenvalue 1 of the spin, there are no simple properties 
associated with the squares of the spin operators—it is always necessary to go to 
the third order to obtain useful commutation properties. In particular the relation 
(4) is often helpful. The details of the derivation of the results below will not be given 
as this is quit© straightforward. 

(a) Operator ( 0 .x ) 8 

Using the usual commutation relationships and (4) it may be established that 
( 0 . m) ( 0 . x) a + ( 0 . x) a ( 0 . m) « ( 0 . m) x 2 - 3A(0. x) 2 + 2fix 2 . 

If we define C * ( 0 ,x) a ( 0 .m)~( 0 .m)( 0 .x) 2 , 

then (©.m)^+ (7(0.111) - -3 hC. (9) 

Applying this result in the various cases we find (dropping the suffix w) 

Cujj =* 0 , ** £' u jnj ~ /*%- xj* (I®) 

Prom the first it follows that ( 0 .x = ®' or the other two states 

C has exchanging properties. The values of A, fi will of course depend on the phase 
constants of the representation. They are determined by 

A = u }+i,j n = 

and from the properties of C, A = - ft*. The product A/e is found from the value of 
C* = — r*{(o. m + fft) 2 - Jft 2 }, so that for operation on u } _ hi or u i+1 y 

C * = —j(j + 1)£V = A/e. 



Interaction of mesons with a potential field 563 

We ohoose the values, corresponding with the explicit form for the u’s already 
given, so that 


Cu j- i.i = VU(i+ !)}*»>#+«. Cu j+1J = -V{iO'+ i.i- (n) 

or alternatively 

(o. x)* u hlJ = [O' +1) +Vb'O +1)} «, +u ], i 

r* I (12) 

(o.x)* u j+hj = -, + 1 - 0Vi,i+Vb'(i +1)} u iiJ- 

(b) Operator (a.x)(a.p) 

For the operator ( 0 .x) (o.p) we can establish the relation 

(o-.m + f h) (o.x)( 0 .p) + ( 0 .x)( 0 .p)(o.m + 4 ft) 

= (o.m + 2ft)(x.p) + i{(o.m + S)*-ft*}. (13) 

An exchanging operator D similar to C above can be formed. It is 
D = (o.x)( 0 .p)( 0 .m)-( 0 .m)(o.x)(o.p) 
and satisfies (a.m)I) + D(a.m) --3hD. (14) 

For the stated = /, D — 0, and from (fl) we immediately conclude 

(o.x)(o. p)u JtJ = (x.p-ih)u u = -ihu u [rj r +iy (15) 


For the states j ~l± 1, pinoe (o.x)(o.p) commutes with M the form 

(o.x) (o.p) U } . ltj = OLUj^j + fi l Uj+1J , (e.x)(a.p)u mj = yu } _ h) + Su^ u (16) 

must hold (the property of D excludes a term in u } j ). The coefficients a, fl, y, S can 
be determined using ( 12 ), and (13), when the result of multiplying both sides of (16) 
by (o.x)* is used. Here it must be noted that (o.x)® = r-(o. x). It is finally found that 


(o.x)(o.p)u,.. u 
(o.x) (a.p)u HU 


- =^ s {'l - (i -'>}[•'- *J{jU “<*'•<] • 

-y?i h& +u+s> ) [“'«•< + J ( J r) "'- u ] • ) 


(17) 


(c) Operator (o.p ) 2 

An exactly similar technique can be used in this case. The relation comparable 
to (13) 

(o.m)(o.p ) 2 + (o.p)*(o.m) = (o.m)p*-34<o.p)* + 2Ap 2 (18) 

now holds, and the operator £ = ( 0 .p)*(o.m)-(o.m) (o.p)* has similar properties 
to the 0 defined earlier, such that 

(o.m) B+ B(o.m) *■ — ihB. (19) 


Vol* 193* A. 





564 


J. C. Gunn 


B = 0 for the state j = l, for whioh 

(o.p)* = p». (20) 

B has again the property of interchanging the states,; = l ± 1 (it is this property 
which corresponds with the spin orbit coupling oaused by the appearance of (a. p)* 
in the Hamiltonian). This shows that, for example 

= u f+i.i L > 

where L is a second-order differential operator in (djdr), The explicit form of L is 
perhaps easiest found by considering the relation 

(a.mHo.p) 8 « if(x.p)(o.p) 2 -(o.x) (o.p)p 2 ] (21) 

together with the result for (o.x) (a.p) already established. This leads to 

Bu j+i.j = h N{j(j+ „ (22) 


or alternatively 

(n w n a, w VO* O' 0} pa 

(O.p) u^j ~ 2iTl P A *- 1 ’ 


(a n) 2 u =s £ vfiu - VOOjjl JL 11 Wu G% 

(o.pj (2j ~\-1 (2/+ 1) ™ 

The differential operators G are defined by 

- [d - j) [d - t 1 ) . 

where d = djdr . 

If we also define 

"i - - Hr)H) - ( J -^) Hr) ■ 

} 


then 

and 


EU&i-i - I>U E U< Wn E U = ^-1 = 

G? + i^5 +1 - />?-xO? + i • 

These results will be found useful later in deriving the radial equations. 


(23) 


(24) 

(25) 


(26) 


5. Radial wave equations 

We have now carried out all the formal work necessary to reduce the wave 
equation to radial form, and shall proceed to find the radial equations directly, 
rather than by trying to oast the Hamiltonian into radial form. The wave function 

may be regarded as made up of two components in the charge-space r/r — 

where i/r v \jr i are both still vectors in the three-dimensional spin space, assuming 
we are dealing with a vector meson field. The particular {i/r v \jr t ) for a given will, 




Interaction of mesons with a potential field 


565 


of course, depend on the representation chosen for the charge operators. Generally 

wetake n o\ /o n 

'-(!-!)■ »-G i)- < 2,) 

As contribution to ijr for a given j and m we shall have 


frl — +fjj t m( r ) u jJ,m (^ 8 ) 

and a similar expression for )jr % with, say, g in plaoe of/. As we have already seen in 
the investigation of the operators, these components subdivide into two non¬ 
combining sets—that for and that from and The suffix m may be 

dropped as it does not enter the equations. The equation for fjj provides no difficulty 
being simply the Klein-Gordon equation. For the other states it might be thought 
that, apart from the higher order, the position was no worse than for the Dirac 
electron. In this latter case, with the present notation, a solution is found in which 


ft 1 5=2 “ ft+t.&ttj+i,*’ 

thus involving two simultaneous first order equations in Qj+u- No such separa¬ 
tion is possible in the meson case. The basic reason for this is the difference in parity 
in the electron case between and no such difference exists between 

j and u j+1 j. Lacking such a separation we must then expect four simultaneous 
second-order equations for the combining functions /> + g^i,p ffj+ij- The 
total eighth order is only apparent, and as remarked by Tamm ( 1940 ), using different 
representations, two simultaneous equations of the seoond order can be found to 
describe these radial functions. 

The radial equations for the stationary states may be found explicitly quite simply 
by substituting the form (28) for ijr in H\jr = Eft, and using the expression for (o.p) 2 
(23). There results 

(ii) (®-F- 2 E 0 + ^OJ + ,) /w ( - ^- [ -D} tlS , tu+ 2 Mj+ !)}«?_, 

( 20 ) 

where E 0 = me *, and V m etj>. Before dismissing in detail the solution for particular 
oases we give briefly the general method of reduction. Introducing 

/ ft = fl-i,j + 9j-\,i> 9\ ~ f i+i,j + 9j+i,f> 

ft ~ fi-u~9j~\,i> 9t ~ f)+i,i~9}-\,P 



37-4 



566 


J* C, Gunn 


and transferring to ‘meson units*—the analogue of Hartree atomic units with the 
appropriate rest mass, we find the simplified equations, 


(i) 

(2j+l) 

(E-V)f l + \ 

f i+± E 0 )f^ =-V00+l )}(% + i9» 

(ii) 

(2j +1) 

(E-V)U +1 

= VOO+W+iJfc. 

(iii) 

(2j+l) 

L 1 

= -V00+ 1 A, 

(iv) 

( 2 j+l) 

(E-V)g 2 + j 

fl} +1 - Jf 0 ) ft] = V00+1)} ^- 1 / 1 . ' 


Two of the functions, say,/ B , g 2 can now be directly eliminated. The elimination is 
tedious, but direct, leading to two second-order equations. Perhaps a simpler 
method of simplification is that whioh leads to equations of the form found by Tamm 
( 1940 ). To find these we define 


Fl = VoJ+i) } +1)i7l_ ViA)* °i = r Ui +VO + 0/1} (32) 

and similarly F , B , G 2 . 

Then, writing v - (E- V)hj{j(j+ 1)}, E' 0 « E 0 /j(j 4 * 1 ), the equations (31) may 
be transformed to yield 

or,-Kir. 

- E ° F ' 


A 


a. 


(33) 


Two simultaneous equations of the second order may be found, for example, by 
eliminating F 2 and G v We find 

{A-2tfJ*r*/(l + E?r*)}F x + RG 2 = 0, AQ 2 + RF X » 0/ 
where * -'£{*/(l + *;r% . (34) 

A - i?i(r ~ + 1) {r |;/( 1 + ^r*)}+t**/( 1 + r*) - £ 0 . 

It is not proposed to discuss these equations further here. No exact solutions to 
them have been found, other than in the field-tree case, and, as noted in a recent 
paper by Bartlett ( 1947 ), their numerical treatment would present a huge problem. 



567 


Interaction of mesons mth a potential field 

It may be noted that the form of the equations is similar to those treated by Stueokel- 
berg (1932) by Jeffreys' approximation. This technique might prove valuable under 
suitable conditions when the Born approximation is not applicable. In the case of 
a Coulomb field it may quite directly be shown that in the neighbourhood of the 
origin the equations have an essential singularity. This point was noted by Corben 
& Schwinger (1940) from a wave treatment and by Tamm (1940) using equations 
similar to ours. The latter makes it clear that the singularity arises from the meson 
magnetic moment, producing a term of order 1 /r 3 in the second order radial equation 
which is the approximate form of ( 34 ) in the neighbourhood of the origin. 


6. FORMULATION OF SCATTERING PROBLEM FOR MESONS 

We now consider the formulation of the single particle elastic scattering problem 
for mesons. The problem is analogous to that treated by Mott (1929, 1932), for 
Dirac electrons, and similar polarization effects may be expected. 

The incident particles are supposed to have momentum p along the 2-axis. They 
may be represented, using the same /?- and <x-representations as previously by the 
wave function 

fA\ 

( /?) e ipzlh . ( 35 ) 

\cj ‘ 


As in the electron case the other charge-space component need not be explicitly 
introduced, and the scattering phenomenon can be described in terms of \Jr x alone. 
In fact, as is seen from (1) 


^2 = 


E-E 0 

E+l\ 


A \ 

-Cl 


( 36 ) 


where E is the energy of the incident particle so that E 2 = A§+ c*p 2 . Thus the charge 
density in the incident beam, is given by 

A Ip Ip 

p = ef'M = <#? ^ - K +*) = e^^y a {| A I* + | B |« + | C |*}, (37) 

and the current density in the z-direction may be verified from the general expression 
to be 

* M c IV- <»» 

In the scattered wave, which can be regarded as made up of spherical waves it will 
equally be necessary, for the asymptotic behaviour, to consider only the wave 
function \jr x . 

The incident plane wave may be expanded in a set of spherical waves. For this 
purpose it is convenient to consider separately the different incident polarizations. 


37-3 




568 


J. C. Gunn 


From (8) it is found, using the orthogonal properties of the coefficients of the «’s 


that 


Ui - M ’ 1 + J{w+i)j + ^ Uixi 

J {2(21 +T)} “ u+1 * ~ 1+ J {2(2/+1)) u, - l ~ 1 ’ - 1 " ^ Ulx ~ 1 

~y{§rn} 



( 39 ) 


For l * 0, l sas 1 the general formulae have to be replaced by special ones. The terms 
w z,i~i,±i n °f * n appear for l * 0,1, and for i = 0. 

It is convenient to consider the various polarizations of the incident wave in turn, 
(a) Spin along positive z-axis (m = 1). If ^ denotes the complete stationary state 
wave function for incident and scattered beams, which is supposed expanded in 
the usual form 


and the incident wave J x is given by 



then 


Jc — pjh j 


(^i“ / x ) — 2 {«?,!+!,it/j,i+ li i“"*V{2 ff (^ + 2)}j i (ir)] 


tV{^(2*+l)WJ}, 


( 41 ) 


( 42 ) 


where again the middle term does not occur for l * 0,1 nor the last for l = 0. The 
denote the usual spherical Bessel functions. Now in general each of the/’s will 
involve two arbitrary constants—one of these will be determined by the regularity 
condition at r * 0. The other is determined by the fact that asymptotically the 
right-hand side of ( 42 ) must represent only a scattered wave. If the asymptotic 
form of say/ u+1#l is given by 

/m+(«) 


(this form will be valid provided the scattering potential falls off quickly enough at 
large distances), and similar expressions hold for the other /’s, then from the 
asymptotic condition 

C%i+ i,i ~ ** yj{2n(l + 2)} exp (»^j +1> i),) 

Ci, 1 - 1.1 * 9 l)}exp(ttf u _ Itl ),[ (44) 

Cui - i‘<J{2n(2l+ l)}exp(ifi UI ). J 



Interaction of mesons with a potential field 569 


The asymptotic form for the scattered wave can then be found. It may be written 
in the form . 


fn - am 

^ 12 = SF 2 ( 0 ) e 2 '^ with S = e ikr /r, 
f ia = SF t (d)e**, 


( 45 ) 


where, in terms of the usual Legendre functions 
* 1 ( 0 ) - + (e«*w+ m- 1) +l)(e»“w-M -1) +J(2i+ i)(e*«y.«-l)]P,(cos 0 ), 

m " i- 1 )+4*' 1 - 1 )“2§TT) 1 ~ 1 >] p?(ooa 6) ' 

m = >■> - 1 )+^Tj (e “' Wt - ^(0080). 

The properties of the scattered beam are thus determined in terms of the phases of 
the various angular momentum components into which it may be resolved. 

The cases where m *» — 1, and m — 0 for the incident beam may be similarly 
treated. The results are quoted below. 

(6) Incident'particle, spin in negative z-direction (m — — 1). With the same notation 
as previously 

Czj+i,-i “ i l {%n(l + 2)} exp 
C h i - Xt -1 = iV{27r(Z-l)}exp(i(J u ^„ 1 ), * 

C u ~i = -*V{ 2ff ( 2 * + J )} ex P ( w u,-i)» 
and for the scattered wave 

fin - £*i(0) e-«*, ^12 - W). = - SF 9 (0) er*. 

(c) m » 0. Q,hi,o “ -*V{ 4 *( , + 1 )! ex PW,/+u)’| 

C'u-i.o = i'V{ 47 r ^ ex l»(»Ai-x.o)> i 
and for the scattered wave 

<fr n = 5^(0) «-*», e'V, ^ 13 = .ST^tf), 


J 5 T,( 0 ) * gk S[(i + 1) - 1) + -1)] P ( (cos d ).j 

It should be noted that the independence on m of the radial equations, together 
with the similarity of the asymptotic form of the radial wave functions for m = + 1, 
ensures that the phases for m - ± 1 are identical. Different values, however, appear 
for m m 0. The use of F v F it F. j, for m = - 1 , with no distinction from the case m - 1 , 
is thus justified. 


where 


im 


2 ^j 2 ik 


V(gS«;,( 11,0 _ Pj’ (COS 0 ), 


( 47 ) 

( 48 ) 

( 49 ) 

( 50 ) 



570 


J. C. Gunn 


The general case when the incident wave is given by 


may now be obtained by superposition. If the scattered wave is given by 



u x = AF x (0) + BF % {d) 4* CH X (6) «-*, 

then - AF % (&) eW + BF X (6) - CH x (0) 

u z - AF % (0) - BF z (0) + CH t (0). 


(52) 


The differential scattering cross-section is given by S | u | 2 . This will depend on the 
polarization of the incident meson beam as determined by the ratios of A } B and (7. 
It is clear that the polarization effects, described by Mott for the electron, will also 
arise for the meson. Thus, for example, if the incident beam is completely unpolar¬ 
ized, which we may represent by putting | J 4| = |J3|=>|C'|* 1/^3, it will not 
follow that | u x | = | u 2 | = | u z |, and the scattered beam will generally exhibit 
some preferential direction of spin. 

In order to consider the matter in greater detail it is necessary to investigate how 
the direction of the spin axis of the meson is determined by A, B and C. For this 
purpose the polar angles of the spin direction will be denoted by (#,&>)• The spin 
measured along this direction has the value 1, so that 

® (A\ 

= I B J. (53) 

Using the representation (7) for the cr’s we find the normalized set of constants 


A = cos 2 ^e“ <w , B — sin^e*", C « 


am*. 


(54) 


With the use of these values the differential cross-section in the direction (6, (j>) when 
the incident beam is polarized in the direction (#,o>) is found to be 

S| * | a ~ <00»4* + BtaHx>{| F i | 2 + l F % |*+ I F 9 |*}4- isin^{2|^| 2 +|ff 8 | 8 } 

+ £ sin** cos 2(fti-0) {F*F Z + F$F X - | F z |*} 

4- sin x{i sin (w - <f>) 4- cos x 008 (<*> ” 0)} {( F * + F$) H x + FJ # 8 } 

4- conj. complex. ' (55) 

The <case of an unpolarized incident beam is obtained by averaging over x and 
and has the obvious value 




( 56 ) 



571 


Interaction of mesons with a potential field 

fc 

7. Born approximation for scattering of vector mesons 

The Born approximation formula for the scattering of vector mesons by a potential 
field has been considered by several authors. The first approximation formula was 
derived originally by Laporte ( 1938 ) and later by Massey & Corben ( 1939 ), using 
wave theory in both cases. The second approximation was found by Ma ( 1942 ) 
using the Kemmer matrix technique. These approximate methods yield a finite 
solution for the scattering cross-section by a Coulomb field, whereas as we have seen, 
no exact solution to this problem exists. In an attempt to assess the validity of the 
approximation we investigate its deduction by considering the decomposition into 
harmonic components. 

From equations (29), using meson units, the Bom approximation for the phases 
may be easily deduced. It is convenient to alter the form of the equations. Thus 

2 \IU(j + 1 )} 1 x (i) ~ lJ ln x (*») “ ( 2 j + 1 ) -frf+i x ( iv) 

yields 

2 VW + -E 0 - F)/,_ u }- Df + 1 {(E-E 0 - V)f j+1 J 

-C2j+m +1 {(E + E 0 -V) 9i+1 J = 0 , 

and from similar other relations 


in~x{(E-E 0 - + 2 M3 + 1)} - E 0 - V)f j+i J 

- (2 j + \)I))_ y {(E + E 0 -V) g } _ u } = 0, 

2 VW + ^Ef-tUE + Eo-V)g J _ 1 J-D^ 1 {(E + E 0 -V)g J+hi } 

- (2 3 + 1 )»U{(E- E 0 -V)f j+1 J = 0,' 
E +E 0 -V) + 2 V{ j{j + 1)} 0% i {(E + E 0 - V)g j+l J 

- (2j + 1 )DU{(E-E 0 - D/y-u) - 0. 


(57) 


Combining these with (29), and writing k 2 


E*-El 

A 


(E, E 0 in meson units) we find 


P7-i + **)/y-W - + 1 )}G? +1 (*W 

-(2 j+ l)l>l- l (Vf j „ 1 ' i ) + 2(E+E 0 )(2j+ 1) VU 

+ **)/,*w - 2l^+T)t 2 VW+ (Vg^-DUtWg,^,) 

- (2j + \)D) +l (Vf j+hj ) + 2(2j +1 )(E+ E 0 ) 

WU+Wg,-!., = ze^T) [ “ ~ 2 Mj+ 1 )} QUA v f)+u) 

+ [2j +1) DU(Vgj-u) + m +l)(E-E 0 ) Vg^',1 

09+1+*% + w - [ “ 2 v ' {i(j +1)} EUi Vfi - lj)+ 

+ (2 j +1) DUi( Vg j+lJ ) + 2(2 \j +1) ( E-E 0 ) Vg j+tJ l 



572 


J. C. Gunn 

If V can be treated as small, a first approximation to the solution will be found 
by giving to/and g on the right-hand side their undisturbed values, i.e. their values 
in the harmonic decomposition of the incident plane wave in the scattering problem. 
The approximation thus found should agree with the Bom approximation found by 
other method^. 

With this assumption the equations are all of the form 

(Df + k*)f(r) = -F(r) 

the general solution to which is 

/ - c Ji( kr ) + c,n,(Ir) + k^Jj(kr) j n t (k8) 8 t F(a) da - ^(kr) jJ t (ka) , (58) 

where j',, n, are the spherical Bessel functions. If the unperturbed solution is/ =? jj(kr), 

and the perturbed solution is such that f ~ -- sin (kr — \ln + i,), then if the integrals 
are both small, we may write 

= kT j,(ka) a*F(a) da. (60) 

For the vector meson it will appear that this latter condition is always unsatisfied, 

through the divergence of f n l (k8)s*F(s)ds. This divergence may not be a very 

Jo 

important one as it is solely due to the singularity of the Coulomb field at the origin, 
and will be removed if the field be supposed cut-off at some distance, say of the order 
of the meson classical radius ~ 10” 16 cm. There can be little physical meaning in 
effects produced by singularities within this radius, and it can be regarded as a purely 
formal difference between electrons and vector mesons that the relativistic magnetic 
moment of the latter makes it necessary explicitly to introduce the cut-off. 

Whilst, apart from the general theoretical difficulties, the qualitative introduction 
of a cut-off seems quite permissible, the matter can scarcely be left to rest there. 
It must be investigated how the phases and cross-sections will depend on the cut-off 
radius. Such a variation is bound to occur at sufficiently high energies and in this 
Way it might even be hoped that experimental results on the Coulomb elastic scat¬ 
tering of vector mesons could eventually throw light on the quantitative behaviour 
of the cut-off. This possibility has, however, been considered too distant to justify 
detailed calculations at the present stage. In general it may be noted that the idea 
of the cut-off fits well into the statistical theories (e.g. Rosen ( 1947 )) now being 
considered by various authors, and if it oould be made quantitative, might provide 
information for their development. 

8. Bohn approximation. Cotjlomb field 

The Born approximation for the phases will now be briefly discussed for the 
particular case of the Coulomb field. For a repulsive field of charge Z, we have 
V » Ze*/r, It is as usual convenient to consider the various polarizations of the 
incident beam in turn. 



Interaction of mesons with a potential field * 573 

(a) m = +1. Apart from an insignificant proportionality factor the zero approxi¬ 
mation functions in this case are given by 

/f¥u.i - -V070+i)}W^). r («i) 

/$.i =W{(2j+l)/0‘+l)b#r). 

It is convenient to work with ordinary units, using, however, as units of length the 
particle wave length l/k. To determine we then have the equation 


(D*_x+ i — i 


2JS7 0 (2j+l) 


h > [ D! -(? #«•■) +2 voo+1» 


where 

Introducing the notation 


*• 


*z*> 
E + E 0 


me a _ v _ 

T’ P ~ c~ 


J3\ 

E*J' 


then, after some reduction, (62) oan be put in the form 

m> +1)f _ ggij-flfr), «* __i._ 

p r + lc 0 (2j + 1) 




«]• 


The first term on the right-hand side gives the usual Rutherford scattering. The 
second term gives to the first order in a the effects of meson magnetic moment, etc. 
The method used in deducing (64) has removed the usual second order relativity 
ot® 

term —^5 j,_i, which would have to be replaced if Z is not small. 

Ignoring the Rutherford term, the relativity correction to the phase is given by 

w. - a*. 


Some oare must be observed in carrying out the integration for the case j = 1, when 
the integrals multiplying (j -1 ) are in fact divergent. As is shown later this corre¬ 
sponds with the singularity of V* V at fhe origin in a Coulomb field. The final result 
is that 

. «®a& 1 .. 

Vw,i ~ “ 2*r (J^T] y-!»-.•••)• 


(66) 



574 


J. C. Gunn 


With a * 1/137 we see that all these phases will be less than 1, until k & 100^. This 
implies 4 wave-length of the order of 10~ 15 cm, for the incident mesons, or an energy 
of the order of 10 4 MeV. At such an energy the assumed departure from the Coulomb 
law at very small distances ~ 10~ l5 cm. must have begun to take effect. In fact, 
if the cut-off is made a distance a, then the lower limit is the integral for 8 becomes 
ka which is 0(1) for the case considered. The cut-off ultimately leads to a diminution 
in the phases as the incident energy is further increased. The asymptotic form is 


r Y~u 


3 3 
4 ak 0 lca' 


In general then we see that, provided a is small, the perturbation method in itself 
will not lead to serious errors in computing the relativistic effects, unless the cut-off 
distance is materially less than 10“ 16 cm, Deviation from its predicted results up 
to an energy of the order of 10 s MeV would be an indication of departure from the 
Coulomb field. 

The derivation of the remaining phases will not be discussed in detail. The results 
are merely quoted below 


tij+XJA 


8 2 ock 1 


O'= 1,2,...), 


I 


dj-l.J.O 


8*txk 1 
1 

a£(i+i) 


= o 


0 = 1 , 2 , 

0=1,2,...), 

0 - 0 ). 


(07) 


(6) Phases for rn — — 1 are equal to those for m * 1. For the j = l components the 
relativistic phase changes are of order a®, being similar to those for scalar mesons. 
In certain cases (scattering by heavy nuclei) suoh terms are important, but they will 
be ignored in the present investigation. 

The conditions under which these phases are likely to be valid have already been 
disoussed. When they are small, we may replace e iS by (1 + t$), and hence determine 
the total angular distribution of the soattering from (46) and (51). Considering only 
the relativistic terms, 


F, = — 


«®<x& 

2k n 


cos® \6, 


s 2 ak 

2 JL 


008 * \ 0 , 


«*«& 1 


(cot ±0- sin d) - F 3 , H t m2F v 


( 68 ) 


8 2fc 0 V2’ 

It may be of interest to note that in the derivation of these functions only the 
l m 0,1 waves contribute to F t and H t . For higher 1-values there is complete can¬ 
cellation between the l =» (j+l) and l - ( j-1 ) component, in our approximation. 



Interaction of mesons with a potential field 575 

The total differential cross-section may be found by adding the Rutherford term 
— —^ cosec 2 \d to F x and H z , and using (56), The result agrees with that found by 
various authors, L&porte (1938), Massey & Corben (1939) 

/ Ze 2 \a 2 / Zt>% \* 

(69) 

It can of course be most speedily derived with absorption and emission matrix 
elements and the spur technique (Sakata & Taketani (1940a)), 


9. Second-order wave equation—perturbation treatment 

The sources of the various phases may be to some extent clarified by a considera¬ 
tion of the second-order wave equation, which in any case provides a speedier 
derivation of the angular distribution of the scattered waves. For a charged vector 
meson in a potential field it is found from the first order equation 

[E—V — — ft( \ +^)p 2 /2w + /?^(o.p) 2 / m ] - (70) 


by operating on it with 

jB?— V + mc 2 /? + /J(l + ^)p 2 / 2 m~/fo/(a.p) 2 /m. 

The equation is found in the form 

(V + *■) t - [ig: _ ^ - JL «1 + ,) (V*K - FV>) + &L «.. V)> V - l'(o. V)')] i,. 

(71) 

A solution to this equation representing an incident wave 


ftf = ItiJ e ikz and tfp - 


j A \ 

/ B \e ikz , 


together with a scattered wave, treated as a perturbation, may be formed by re¬ 
placing \jr on the right-hand side by (see, for example, Mott & Massey ( 1933 )). 
The equation for yjr x then has the form 

(V*+l )r/r t =V(r,0,4>)e<*, (72) 


where once more the unit of length 1 jk is used. 
This gives as the scattered wave 

= u{0,<j>)e ir jr, 


(73) 


where u(0,</>) =-~je< s '‘*»rU(r')dr', K = 2sin£0, (74) 

y being the angle between the directions {O', <j>') and {^{n-0),n+ip). U is easily found 
in the general case. For the Coulomb field, when V = Ze*/r, care must be taken to 



676 


J. C. Gunn 


add the term V*F * — 47 TZe*t{r) and a similar term in the expression for (<r. V)*F 
These ^-function terms correspond to the peculiarity mentioned earlier in the 
evaluation of the phases & 0 ,i.±i and 6 01i 0 . It may easily be verified, with this pre¬ 
caution, that the expressions for F v H v etc. are as found before. 


10. Scattering by spherical well 


Special theoretical interest attaches to the case of scattering by a spherical 
potential well, as this is the only form of field for which an exact solution can readily 
be given. We consider a well of the form 


and write 


V - -V 0 (r<r 0 ), F = 0 (r>r 0 ), 

t.2 _ &-EI _ [ E+V 0 f-El 
~ h*c* 


(75) 


The equations for the radial wave functions outside the well are then of the form 

(Df_ x + fc*)(^- u - 0, (Dj+i + k*)i^ +l,i = 0, (D)+ &){&■> - 0. (76) 

\9i-ui wi+isi 1 9u 

Inside the well similar equations hold with k replaced by k\ Each of the functions 
contains two arbitrary constants; externally one is determined by the asymptotic 
behaviour at infinity, and internally one from the behaviour at the origin. The 
remaining two constants are fixed by the continuity conditions across the boundary 
r = r 0 . 

As in the earlier problems we approach the oase of general meson spin by super¬ 
posing the simple ones, and oonsider firstly m = 1, so that the incident wave is 
determined by 

\jr l » e ik * 

From the asymptotic conditions it may then easily be verified that outside the 
well is of the form 

S - »V(i + 1 ) hfU(kr) +' tiller)} 

+ i hf +l (kr) + hf +1 (lcr)} u j+1J1 

+y/(2j + l){e iU M.ihf\kr) + hf(kr)}u }JA ]. (78) 

Here h^\ A® are used to denote the spherical Hankel functions of the first and second 
kinds, defined so that n . 

h<™\p)~J(^HW?(p). (79) 

From the condition at r = 0, inside the well must be of the form 

\jr x = S + a j+\j, d]+i(k' r ) v, }+i,),i+ a j,i,ijj{k' r ) (80) 

i 

The forms for \[r t may immediately be deduced from the above, but as they are rather 
cumbersome will not be quoted here. 




677 


Interaction of mesons with a potential field 


Boundary conditions 

To oomplete the solution the a’s and d's must be determined by the boundary 
conditions at r r 0 . It is not immediately evident, as for Schrodinger and Dirac 
particles, what these boundary conditions must be. However they may be derived, 
either from the degenerate form of the field equations, or by considering the normal 
component of the current density vector, the conservation of whioh must be con¬ 
tained in the field equations. There is, in general, no need for charge density to be 
continuous across a boundary, a condition in fact satisfied for Schrodinger and Dirac 
particles but not by mesons. In this respect the hydrodynamioal analogue to the 
meson behaviour at a potential barrier is the formation of shock waves in com¬ 
pressible fluid flow. 

The simplest physical approach is to consider the normal component of the 
current density vector. The space components of this vector are given (of. Heitler 
(^943)) by 

»* = ^ ~2m Pk ^~ ^ 2wc• °) + (P • °) < r k}'/ r + ^(P x a) k f + oonj. complex. 

(81) 

It may immediately be verified that for a steady state they satisfy the continuity 
equation A ivi -n 


It is convenient to modify this expression so as to render obvious the requirements 
for the continuity of i„ . This may most compactly be performed by introducing 
the operators ^ + . a) — (p. a) <r k = p k + t(p x o) fc ,l 

Pk = i>*-^fc(P-o)-(P-o)o-*- > 

If r k ■* (1 — 2<r|), so that = 1, then with the help of the <r-commutation properties 

(4) it follows that _ a „ „ „ o 

T/cPk = a fc. T k Ct k ~ Pk- 


From (81) we have 2 mci k = (fri + T k fr t y (<x k ^r x -‘rPk'Pt) + oonj. (83) 

So, in general, the normal current density component i n will bo continuous across 
a boundary if (fri + r n ifr t ) and (a n fri + PnPi) are both continuous. 

The investigation of the boundary conditions does not strictly end here, for the 
conservation of momentum requires separate proof. This may be obtained from 
a straightforward demonstration that (83) is equivalent to the degenerate form of 
the field equations, whioh will be omitted here. 

The boundary conditions may be simplified slightly in the present case where the 
potential is a oonstant. With the help of the field equations it will here be found that 

fv where a* - (E-E 0 )/(E+E 0 ). (84) 


Thus the second boundary condition may be expressed 


(1 — a) a n fa continuous. 


(85) 



578 


J. C. Gunn 


Determination of phases 


With rjfy as given by (78) and (80) the phases S i+ltfa , S itjl can now be 

determined from the boundary conditions. The formulae concerned are straight¬ 
forward but their derivation is long, and only the final results will be quoted. It is 
useful to define 


Aj-u — ^jj(^r 0 )hf}.i(lcr 0 )~kj^ 1 (kr 0 )h^-\kr 0 )A 
Aj+u = kj } (k r 0 ) k^lj(/cr 0 ) — kjj+Jk r 0 ) h^\krj, j 


C,- 


¥ 


. - } j^' r o) up,j, r \ _ K j }(k’ r o ) . a>/ jl_ \ 

\k(l-s*) fc'(l-*'*)/ r 0 *' (ICr °> ~ 2E 0 8s' r 0 

so that — (2j*l* I)(l ~ ss and Aj~. i,y = Aj~\ t 

In terms of these quantities it is found that 


(87) 

(88) 


= FJH, s**m i,*t = GJH, 

where F x = Aj +1 jAf„ l j — 2js& r {Aj~ l jC* — CjAf„ l j — 2(j +1)C^CJ}/ 


(89) 


For the non-relativistic case when 0, it is immediately evident that these 

formulae reduce to the usual ones for Schrodinger particles with spin 1. 

The formula for S^ j x is much simpler, being in fact identical with that for a scalar 
meson—this peculiarity of the j » l components has already been noted. In this way 

= - 4 ^ • (90) 

A i,i +1 A U -1 

The phases for to = — 1 are identical with those for to = 1. This is formally evident 
from the fact that the asymptotic form for \jr x in this case is identical with that for 
to 1, except for a change in sign of the j = l component. The differential equations 
governing the radial functions are independent of to, so the phases in the two oases 
must agree. For to = 0 the asymptotic form for fa is different, and this difference 
in boundary condition has the effeot of altering the phases. Outside the well 

fa - > + hflj + J(j + l){e 2 «y +l .i.o^ 1 > 1 + ^»| 1 }]. (91) 


On applying the boundary conditions at r = r 0 it is found that 


.).* = F 0 /H, c*«y+i.*• - GJH, 


where F 0 — A j+1 j A*„ lif - 2(j +1) ss'{A t ^ lif Cf - CjAf^j — 2jC } Cf}, | 

G 0 = Aj~ hj —‘2jss'{Aj +1 j Cj—C jA* +1 j — 2(j + 

and H is as defined before (89). 

For the present it has not been considered worth while to apply these formulae 
to the calculation of the phases in any specific case of spherical well scattering. Their 



579 


Interaction of mesons with a potential field 

chief interest might be to provide for one field at least, though admittedly a rather 
specialized one, an exact picture of the difference in behaviour between relativistic 
electrons and vector mesons. To this extent they may throw light on the scattering 
of mesons by other types of field, for which at present only the Bom approximation 
treatment is available. The rather unsatisfactory position of the Coulomb field in this 
respect has already been discussed. 

11. Concluding remarks 

It has been the aim of this paper to formulate the principal one-body problems for 
the relativistic meson in interaction with the electro-magnetio field, in a uniform 
manner and analogously to the standard treatments of the relativistic electron 
problems. In theory the programme can be satisfactorily carried through, but the 
fourth order equations involved combined with different polarizations to be con¬ 
sidered in each case, make the .exact solution of any scattering or stationary state 
problem a considerable numerical task. However, even for Schrodinger particles 
exact solutions are formidable enough and frequently not required. It may be that, 
as with them, a combination of the Born and Jeffreys approximation methods in 
the general formalism would be adequate for the consideration of many problems 
of elastic meson scattering by an electric field. The matter must remain to some extent 
academic until the experimental possibility arises of the observation of such scat¬ 
tering, divorced from scattering by the intrinsic meson field. 

Acknowledgement and thanks are due to Professor H. S. W. Massey, F.R.S. for 
suggesting this investigation and for helpful discussions during its progress. 


References 

Booth, F. & Wilson, A. H. 1940 Proc. Roy . Soc. A, 175, 483. 

Bartlett, J. H. 1947 Phys. Rev . 72, 219. 

Corben, H. C. & Schwinger, J, 1940 Phys . Rev. 58, 953. 

Darwin, C, G. 1928 Proc. Roy . Soc . A, 118, 187. 

Heitler, W. 1943 Proc. Roy, Irish Acad. A, 49, 1. 

Kemmer, N. 1939 Proc . Roy. Soc. A, 173, 91. 

Laporte, O. 1938 Phys Rev . 54, 905. 

Ma, S. T. 1942 Phys . Rev. 62, 403. 

Massey, H. S. W. & Corben, H. C. 1939 Proc. Comb. Phil. Soc . 35, 463. 

Mott, N. F. 1929 Proc. Roy. Soc. A, 124, 425. 

Mott, N. F. 1932 Proc . Roy. Soc . A, 135, 429. 

Mott, N. F. & Massey, H. S. W. 1933 Theory of atomic collisions . Oxford: Clarendon Press. 
Prooa, A. 1936 J . Phys. et Radium , 7, 347. 

Rosen, N. 1947 Phys. Rev. 72, 298. 

Sakata, 8 . & Taketani, M. 19400 Inst. Phys. and Ohem. Res., Tokyo, 38, 1. 

Sakata, S. & Taketani, M. 19406 Proc. Phys. Math. Soc., Japan , 22, 767. 

Stueckelberg, E. C. G. 1:932 Helv. Phys. Acta, 5, 370. 

Tamm, I. 1940 C.R. Acad. Sci . U.R.S.S. 29, 551. 



INDEX TO VOLUME 193 (A) 

Accuracy of atomic co-ordinates derived from Fourier series in X-ray crystallography. V. 
(Booth 4 Britten), 305. 

Anisotropic metals, theory of yielding (Hill), 281. 

Atmospherics, propagation of the ‘slow’ or tail component (Hales), 60. 

Axon, H. J. & Hume-Rothery, W. The lattice spacings of solid solutions of different 
elements in aluminium, 1. 

Bakerian Lecture. Some problems in connexion with the development of a high speed 
Diesel engine (Ricardo), 210. 

Batchelor, G. K. & Townsend, A. A. Decay of isotropic turbulence in the initial period, 
539. 

Boas, W. <fe. Hargreaves, M. E. On the inhomogeneity of plastic deformation in the 
crystals of an aggregate, 89. 

Booth, A. D. 4 Britten, K. H. V. The accuracy of atomic co-ordinates derived 
from Fourier series in X-ray crystallography. V, 305. 

Britten, K. H. V. See Booth 4 Britten. 

Burgoyne, J. H. & Williams-Leir, G. The influence of incombustible vapours on the 
limits of inflammability of gases and vapours in air, 525. 

Carbon dioxide-carbon reaction (Gadsby, Long, Sleightholm 4 Sykes), 357. 

Chalmers, B., King, R. 4 Shuttleworth, R. The thermal etching of silver, 465. 

Control of the temperature of endothermic reactions in flow systems (Gadsby 4 Sykes), 
400. 

Coulson, C. A. 4 Longuet-Higgins, H. C. The electronic structure of conjugated systems. 
III. Bond orders in unsaturated molecules, 447. IV. Force constants and interaction 
constants in unsaturated hydrcarbons, 456. 

Decay of isotropio turbulence in the initial period (Batchelor 4 Townsend), 539. 

Diesel engine, development of a high speed (Ricardo), 210. 

Egerton, Sir Alfred 4 Powling, J. The limits of flame propagation at atmospheric pressure. 
I. The influence of ‘promoters*, 172. II. The influence of changes in the physical 
properties, 190. 

Electronic structure of conjugated systems (Coulson 4 Longuet-Higgins). Ill, Bond 
orders in unsaturated molecules, 447. IV. Force constants and interaction constants in 
unsaturated hydrooar bo ns, 456. 

Excitation of nuclei by electrons (Sneddon 4 Touschek), 344. 

Fouoault test, interpretation (Linfoot), 248. 

Gadsby, J.,*Long, F. J., Sleightholm, P, 4 Sykes, K. W. The mechanism of the carbon 
dioxide-carbon reaction, 357. 

Gadsby, J. 4 Sykes, K. W. The control of the temperature of endothermic reactions in 
flow systems, 400. 

Galvanomagnetic and thermomagnetic effects in metals (Sondheimer), 484. 

Gilles, D. C. The use of interlacing nets for the application of relaxation methods to 
problems involving two dependent variables, 407. 

[ 580 ] 



Index 581 

Green, A. E. & Willmore, T. J. Three-dimensional stress systems in isotropic plates. 11,229. 
Gunn, J. C. Interaction of mesons with a potential field, 659. 

Hales, A, L. A possible mode of propagation of the ‘slow’ or tail component in atmo¬ 
spherics, 60. 

Hargreaves, M. E. Seejiom & Hargreaves, 

Hartree, D. R. & Hartree, W. Self-consistent field, with exchange, for nitrogen and 
sodium, 299. 

Hartree, W. See Hartree & Hartree. 

Hill, R. A theory of the yielding and plastic flow of anisotropic metals, 281. 
Hume-Rothery, W. See Axon & Hume-Rothery, 

Hydrt lynamics of non-Newtonian fluids. I (Rivlin), 260. 


Inflamm jility of gases and vapours in air (Burgoyne & Williams-Leir), 525. 

Influence of incombustible vapours on the limits of inflammability of gases and vapours 
in air (Burgoyne & Williams-Leir), 525. 

Interaction of mesons with a potential field (Gunn), 559. 

Investigation of radio-frequency radiation from the sun (Ryle & Vonberg), 98. 

Isotropic turbulence in the initial period (Batchelor & Townsend), 539. 


Kollaway, C, H. The Wolleome Research Institution, 435. 

King, R. See Chalmers, King & Shuttleworth. 

Klaukens, H. & Wolfhard, H. G. Measurements in the reaction zone of a Bunsen flame, 
512. 

Lattice spaoings of solid solutions of different eloments in aluminium (Axon & Hume- 
Rothery), 1. 

Lawton, H. & Stewart, K. H. Magnetization curves for ferromagnetic single crystals, 72. 

Limits of flame propagation at atmospheric pressure (Egerton & Powling). I. The 
influence of ‘ promoters 172. II. The influence of changes in the physical properties, 
190. 

Linfoot, E. H. On the interpretation of the Foucault test, 248. 

Long, F. J. See Gadsby, Long, Sleightholm & Sykes. 

Long, F. J. & Sykes, K. W. The mechanism of the steam-carbon reaction, 377. 

Longuet-Higgins, H. (J. See CouLson & Longuet-Higgins. 

Magnetization curves for ferromagnetic single crystals (Lawton & Stewart), 72. 

Martyn, D. F. Solar radiation in the radio spectrum. I. Radiation from the quiet sun, 44. 

Matrices of integers associated with self-transformations of surfaces (Room), 25. 

Measurements in the reaction zone of a Bunsen flame (Klaukens & Wolfhard), 512. 

Mechanism of the carbon dioxide-carbon reaction (Gadsby, Long, Sleightholm & Sykes), 
357. 

Mechanism of the steam-carbon reaction (Long & Sykes), 377. 

Natural evaporation from open water, bare soil and grass (Penman), 120. 

On the inhomogeneity of plastic deformation in the crystals of an aggregate (Boas & 
Hargreaves), 89. 

On the interpretation of the Foucault test (Linfoot), 248. 



582 , Index 

Penman, H. $$' Natural evaporation from open water, bare aoil and grass, 120. 

Plastic deformation in the crystals of an aggregate (Boas tc Hargreaves), 89. 

Possible mode of propagation of the "slow’ or tail component in atmospherics (Hales), 60. 
Fowling, J. See Egerton & Powling. 

Radiation from the sun (Ryle & Vonberg), 98. ; 

Ratcliffe, J. A. & Shaw, I, J. A study of the interaction of radio waves, 311. 
Relaxation methods applied to engineering problems. XIIT. The flexure anid extension 
of perforated elastic plates (Southwell), 147. 

Ricardo, Ht R, Bakerian Lecture. Some problems in connexion with the development of 
a high speed Diesel engine, 210. 

Rivlin, R. S. The hydrodynamics of non-Newtonian fluids. I, 260. 

Room, T. G. Matrices of integers associated with self-transformations of surfaoes, 25. 
Ryle, M. & Vonberg, D. D. An investigation of radio-frequency radiation from the sun, 98. 

Self-consistent field, with exchange, for nitrogen and sodium (Hartroe & Hartree), 299. 
Shaw, I. J. 8w Ratcliffe & Shaw. 

Shuttle worth, R, See Chalmers, King & Shuttle worth. 

Sleightholm, P. See Gadsby, Long, Sleightholm & Sykes. 

Sneddon, I. N. & Touschek, B. P. The excitation of nuclei by electrons, 344. 

Solar radiation in the radio spectrum. I. Radiation from the quiet sun (Martyn), 44. 
Some problems in connexion with the development of a high speed Diesel engine. 
Bakerian Lecture (Ricardo), 210. 

Sondheimer, E. H. The theory of the galvanomagnetic and thermomagnetic effects in 
metals, 484. 

Southwell, R. V. Relaxation methods applied to engineering problems. XIII. The flexure 
and extension of perforated elastic plates, 147. 

Steam-carbon reaction (Long & Sykes), 377. 

Stewart, K. H. See Lawton & Stewart. 

Study of the interaction of radio waves (Ratcliffe & Shaw), 311. 

Sykes, K. W., See Gadsby, Long, Sleightholm & Sykes; Gadaby & Sykes; and Long & 
Sykes. 

Theory of the galvanomagnetic and thermomagnetic effects in metals (Sondheimer), 484. 
Theory of the yielding and plastic flow of anisotropic metals (Hill), 281. 

Thermal etching of silver (Chalmers, King & Shuttleworth), 465. 

Three-dimensional stress systems in isotropic plates. II (Green & Willtnore), 229. 
Touschek, B. F. See Sneddon & Touschek. 

* Townsend, A. A. See Batchelor & Townsend. 

Use of interlacing nets for the application of relaxation methods to problems involving 
two dependent variables (Gilles), 407, 

Vonberg, D. D. See Ryle & Vonberg. 

Wellcome Research Institution (Kell&way), 435. 

Williams-Leir, G. See Burgoyne & Williams-Leir. 

Willmore, T. J. See Green & Willmore. 

Wolfhard, H. G. See Ktaukena & Woifhard. 

**:■ 

END OF THE ONE HUNDRED AND NINETY-THIRD VOLTTBUD (SERIES A) 






